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MAMMALIAN EDG-5 RECEPTOR HOMOLOGS 

FIELD OF THE INVENTION 

The present invention is in the field of molecular biology; more particularly, the 
present invention describes a nucleic acid sequence and an amino acid sequence for novel 
mammalian, including human, EDG-5 receptor homologs. 

BACKGROUND OF THE INVENTION 



The family of edg receptors are commonly grouped with orphan receptors because 
their endogenous ligands are not known (for example see Hla T and Maciag T (1990) J Biol 
Chem. 265:9308-13; US 5,585,476). Recently, however, lysophospatidic acid has been 
demonstrated to be the endogenous ligand for the edg-2 receptor (Hecht et al. (1996) J. Cell. 
15 Biol. 135: 1071-1083; An et al. (1997) Biochem. Biophys. Res. Comm. 213: 619-622). 

The edg family of receptors is seven transmembrane G protein coupled receptors 
(T7Gs). T7Gs are so named because of their seven hydrophobic domains, which span the 
plasma membrane and form a bundle of antiparallel a helices. These transmembrane 

20 segments (TMS) are designated by roman numerals I- VII and account for structural and 
functional features of the receptor. In most cases, the bundle of helices forms a binding 
pocket; however, when the binding site must accommodate more bulky molecules, the 
extracellular N-terminal segment or one or more of the three extracellular loops participate in 
binding and in subsequent induction of conformational change in intracellular portions of the 

25 receptor. Specifically: the TM-VII is generally a highly conserved portion of the T7G 
receptors, and is often critically involved in ligand binding and receptor activation, the 
intracellular carboxy-terminal is involved in interactions with intracellular proteins, including 
those that transduce intracellular signals upon receptor activations; the carboxy-terminal is 
usually hydrophilic and highly antigenic relative to the receptor polypeptide as a whole and 

30 shows greatly reduced conservation. 

Once the receptor is activated, it interacts with an intracellular G-protein complex 
which mediates further intracellular signaling activities generally the production of second 
messengers such as cyclic AMP (cAMP), phospholipase C, inositol triphosphate, activation 
35 of protein kinases, alteration in the expression of specific genes. 



1 
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T7G receptors are expressed and activated during numerous developmental and 
disease processes. Identification of a novel T7G receptor provides the opportunity to diagnose 
or intervene in such processes, and the receptor can be used in screening assays to identify 
5 physiological or pharmaceutical molecules, which trigger, prolong or inhibit its activity or 
differentially modulate distinct intracellular pathways that are controlled from T7G receptors. 

SUMMARY OF THE INVENTION 

10 The invention provides isolated and unique nucleotide sequences which encode novel 

mammalian receptor homologs EDG-5, including murine EDG-5 (MEDG-5) and human 
EDG-5 (HEDG-5-5). Herein, the nucleotide sequence encoding MEDG-5 and HEDG-5 is 
designated medg-5 and hedg-5, respectively. 

15 The present invention also relates to the isolated and unique nucleotide sequences of 

the complement of edg-5 mRNA, particularly hedg-5. In addition, the invention features 
nucleotide sequences which hybridize under stringent conditions to edg-5, particularly, hedg- 
5. 

20 In addition, the present invention relates to expression vectors and host cells 

comprising such edg-5 nucleotide sequences. 

More particularly, the present invention provides fragments which are useful as 
antibodies for EDG-5, for example fragments in the TM-VII and carboxy-terminal domain. 

25 

Furthermore, the invention relates to the use of nucleic acid and amino acid sequences 
of mammalian EDG-5, and more particularly, to the use of HEDG-5, or its variants, in the 
diagnosis or treatment of diseased cells and/or tissues associated with aberrant expression of 
HEDG-5. 

30 

Additional aspects of the invention are directed to the EDG-5 receptor, but more 
particularly, the HEDG-5 receptor, and include: the antisense DNA of edg-5/hedg-5; cloning 
or expression vectors containing edg-5/hedg-5; host cells or organisms transformed with 
expression vectors containing edg-5/hedg-5; chromosomal localization of hedg-5; expression 

2 
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and tissue distribution of edg-5/hedg-5; a method for the production and recovery of purified 
EDG-5/HEDG-5 from host cells; purified protein, EDG-5/HEDG-5, which can be used to 
identify inhibitors for the downregulation of signal transduction involving EDG-5/HEDG-5; 
and methods of screening for ligands of edg-5/hedg-5 using transformed cells. 

5 

Particularly there is provided an isolated nucleotide sequence selected from the group 
consisting of: 

(a) the nucleotide sequence comprising nucleotides 36-10974 of SEQ. ID NO: 13 
(Figure 3 A) (b) the nucleotide sequence of Figure 3B; 
10 (c) the nucleotide sequence of Figure 3C; 

(d) the nucleotide sequence comprising at least about 70% sequence identity to (a), 
(b) or (c), more preferably at least about 80-85% sequence identity, and even more preferably 
at least about 90% sequence identity, and most preferably at least about 95% sequence 
identity, and which nucleotide sequence hybridizes under stringent conditions to the 

1 5 nucleotide sequence of (a), (b) or (c), respectively; or portions thereof, and 

(e) the nucleotide sequence which encodes the amino acid sequence of Figure 4 A 
(SEQ ID NO. 14), 4B or 4C. There is also provided: expression vectors; host cells; purified 
amino acid sequences; complementary nucleic acid sequences; biologically active fragments; 
and hybridization probes, for such nucleotide sequences and their encoded amino acid 

20 sequences. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A shows a partial DNA sequence of clone 501 which is a murine edg-5 clone 
25 (SEQ ID NO: 3). 

Figure IB shows the full length DNA sequence of the a subclone of a mhedg-5 
pBluescript subclone and the predicted amino acid sequence thereof. 

30 Figure 2 shows the amino acid sequence encoded by the DNA sequence of Figure 1 A 

(SEQ ID NO: 15) 

Figure 3 A shows a nucleotide sequence of hedg-5 cDNA inserted into pcDNA3, 
nucleotides 36-1097 of which encode the full length HEDG-5. (pC3-hEdg5-3) 

3 
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Figure 3B shows a nucleotide sequence of hedg-5 cDNA of clone pC3-hEdg5#3.4, 
which encodes the full length HEDG-5. 

Figure 3C shows a nucleotide sequence of hedg-5 cDNA of clone pc3-hEdg5#28, 
5 which encodes the full length HEDG-5. 

Figure 4 A shows an alignment of the genomic DNA of Figure 3 A (which corresponds 
to the cDNA of the pC3-hEdg5-3 from nt 251-1523 and the genomic DNA flanking from nt 
1-250) with the predicted amino acid sequence. 

10 

Figure 4B shows the predicted amino acid sequence of hedg-5 cDNA of Figure 3B. 

Figure 4C shows the predicted amino acid sequence of hedg-5 cDNA of Figure 3C. 

15 Figure 5 A shows the alignment of the predicted amino acid sequences of HEDG5 

translation products of clones pC3-hedg5-3 , pC3-hedg5#4, and pC3-hedg5#28 as set out in 
Figures 4A, 4B and 4C, respectively. 

Figure 5B shows the alignment of the amino acid sequence of murine edg-5 with the 
20 amino acid sequence of human edg-5 from the pC3-hEdg5#3.4 clone. 

Figure 6 shows the functional response of the pC3-hedg5#4, pC3-hedg5-3 and pC3- 
hedg5#28 clones to anandamide and to LPA by activation of NF-kB production. 

25 Figure 7 shows the SRE response and AP-1 response of pC3-hedg5#28 when treated 

with 10[iMLPA. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The invention relates in one respect to polynucleotides, in their isolated form, that 

code for mammalian, including murine and human, EDG-5 receptors. The EDG receptors are 
characterized by structural features common to the G-protein coupled receptor class, 

4 
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including seven transmembrane regions, and by the functional properties of binding 
lysophospholipid selectively. When expressed functionally in a host cell, i.e., in operable 
linkage with a responsive second messenger system the EDG-5 receptors are capable further 
of responding to lysophospholipid binding by signal transduction. In this regard, the activity 
5 of a G-protein coupled receptor such as an EDG-5 receptor can be measured using any of a 
variety of appropriate functional assays described hereinbelow. 

As used herein and designated by the upper case abbreviation, EDG-5, refers to 
mammalian EDG-5 receptor homolog in either naturally occurring or synthetic form and 
10 active fragments thereof and the lower case edg-5 to the nucleotide sequence thereof The 
mammalian receptor, EDG-5, are characterised by structural features common to the G- 
protein coupled receptors, including the seven transmembrane regions, and by the sequence 
identity to each other of greater than about 56%, more preferably greater than about 70% 
identity, and most preferably greater than about 80% identity. 

15 

Furthermore, as used herein, the human EDG-5 receptor is designated as HEDG-5 and 
the nucleotide sequence as hedg-5 and the murine EDG-5 receptor is designated as MEDG-5 
and the nucleotide sequence as medg-5. 

20 The novel murine hedg-5 sequence was isolated following PCR from a murine 

neuronal cell line using degenerate primers based on conserved regions of transmembrane 
domains (TM-2) and TM-7 of the G protein-coupled receptor (GPCR) superfamily. Sequence 
comparison with known sequences demonstrated that this mouse clone represented a gene 
related to, but not identical to edg-2, an orphan GPCR. Sequence identity was 49% at the 

25 nucleotide level. In the studies detailed herein the hedg-5 sequence was used, however, these 
studies and the applications detailed herein could be undertaken using the novel mouse edg-5 
sequence disclosed herein. 

A full-length mouse sequence is obtained using methods well known to those of skill 
30 in the art. For example, by screening an arrayed mouse library (Genome Systems Inc.) using 
the full-length human edg-5 cDNA. The hedg-5 sequence is first radiolabeled using the 
condon priming method and then hybridized to the PAC filters and washed at high 
stringency, with the final wash done for 30 min at 65 C in IX SSC. Genomic DNA inserts 

5 
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from the clones with the strongest signals can be shotgun subcloned into pBluescript or a 
comparable cloning vector, using at least 3 different restriction digests of which 1 should 
have a 4 bp recognition site. Each digest yields a different subclone library, which in turn can 
be screened with the same cDNA probe under the same stringency conditions. Positives are 

5 picked, grown, mapped by restriction digest and Southern blotting to identify the size of the 
hybridizing insert, then sequenced using primers based on either the vector sequence, or on 
human edg-5 sequences. The position of the single intron seen in the human edg-5 gene 
should be conserved in the mouse gene. Thus, primers can be designed with a high degree of 
confidence to obtain the complete coding sequence of the mouse edg-5 gene without 

1 0 including intron sequences. Once the coding region has been determined, new PCR primers 
can be designed to amplify the cDNA directly from various tissue and/or cell line sources. A 
more detailed description of this approach can be found in Maniatis et al. Molecular Cloning, 
a Laboratory Manual (Cold Spring Harbor Press, 1 989). 

1 5 All publications and patent applications mentioned herein are incorporated by 

reference for the purpose of describing the methodologies, cell lines and vectors, among other 
things. However, nothing herein is to be construed as an admission that the invention is not 
entitled to antedate such disclosure, for example, by virtue of prior invention. 



20 Definitions 

The following definitions are used herein for the purpose of describing particular 
terms used in the application. Any terms not specifically defined should be given the 
meaning commonly understood by one of oridinary skill in the art to which the invention 
pertains. 

25 

As used herein "isolated" means separated from polynucleotides that encode other 
proteins. In the context of polynucleotide libraries, for instance, a EDG-5 receptor-encoding 
polynucleotide is considered "isolated" when it has been selected, and hence removed from 
association with other polynucleotides within the library. Such polynucleotides may be in the 
30 form of RNA, or in the form of DNA including cDNA, genomic DNA and synthetic DNA. 

As used herein "purified" refers to sequences that are removed from their natural 
environment, and are isolated or separated, and are at least 60% free, preferably 75 % free, 
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and most preferably 90% freee from other components with which they are naturally 
associated. 



An "oligonucleotide" is a stretch of nucleotide residues which has a sufficient number 
5 of bases to be used as an oligomer, amplimer or probe in a polymerase chain reaction (PCR). 
Oligonucleotides are prepared from genomic or cDNA sequence and are used to amplify, 
reveal or confirm the presence of a similar DNA or RNA in a particular cell or tissue. 
Oligonucleotides or oligomers comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about 35 nucleotides, preferably about 25 nucleotides. 

10 

"Probes" may be derived from naturally occurring or recombinant single - or double - 
stranded nucleic acids or be chemically synthesized. They are useful in detecting the 
presence of identical or similar sequences. 

15 A "portion" or "fragment" of a polynucleotide or nucleic acid comprises all or any 

part of the nucleotide sequence having fewer nucleotides than about 6 kb, preferably fewer 
than about 1 kb. A portion or fragment can be used as a probe. Such probes may be labeled 
with reporter molecules using nick translation, Klenow fill-in reaction, PCR or other methods 
well known in the art. To optimize reaction conditions and to eliminate false positives, 

20 nucleic acid probes may be used in Southern, Northern or in situ hybridizations to determine 
whether DNA or RNA encoding HEDG-5 is present in a cell type, tissue, or organ. 

"Reporter" molecules are those radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents which associate with, establish the presence of, 
25 and may allow quantification of a particular nucleotide or amino acid sequence. 

"Recombinant nucleotide variants" encoding HEDG-5 may be synthesized by making 
use of the "redundancy" in the genetic code. Various codon substitutions, such as the silent 
changes which produce specific restriction sites or codon usage-specific mutations, may be 
30 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic host system, respectively. 
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"Chimeric" molecules may be constructed by introducing all or part of the nucleotide 
sequence of this invention into a vector containing additional nucleic acid sequence which 
might be expected to change any one (or more than one) of the following HEDG-5 
characteristics: cellular location, distribution, ligand-binding affinities, interchain affinities, 
degradation/turnover rate, signaling, etc. 

"Biologically Active or Active" refers to those forms, fragments, or domains of any 
HEDG-5 polypeptide which retain at least some of the biological and/or antigenic activities 
of any naturally occurring HEDG-5. 

"Naturally occurring HEDG-5" refers to a polypeptide produced by cells which have 
not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including but not limited to 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 



"Derivative" refers to those amino acid and nucleotide sequences which have been 
chemically modified. Such techniques for amino acid deravitives include: ubiquitination; 
labeling (see above); pegylation (derivatization with polyethylene glycol); and chemical 
insertion or substitution of amino acids such as ornithine which do not normally occur in 
20 human proteins. A nucleotide sequence derivative would encode the amino acid which 
retains its essential biological characteristics of the natural molecule. 

"Recombinant polypeptide variant" refers to any polypeptide which differs from 
naturally occurring HEDG-5 by amino acid insertions, deletions and/or substitutions, created 
25 using recombinant DNA techniques. Guidance in determining which amino acid residues may 
be replaced, added or deleted without abolishing activities of interest may be found by 
comparing the sequence of HEDG-5 with that of related polypeptides and minimizing the 
number of amino acid sequence changes made in highly conserved regions. 

30 Amino acid "substitutions" are conservative in nature when they result from replacing 

one amino acid with another having similar structural and/or chemical properties, such as the 
replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, or a 
threonine with a serine. 



8 
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"Insertions" or "deletions" are typically in the range of about 1 to 5 amino acids. The 
variation allowed may be experimentally determined by producing the peptide synthetically 
or by systematically making insertions, deletions, or substitutions of nucleotides in the hedg-5 
5 sequence using recombinant DNA techniques. 

A "signal or leader sequence" can be used, when desired, to direct the polypeptide 
through a membrane of a cell. Such a sequence may be naturally present on the polypeptides 
of the present invention or provided from heterologous sources by recombinant DNA 
10 techniques. 

An "oligopeptide" is a short stretch of amino acid residues and may be expressed from 
an oligonucleotide. It may be functionally equivalent to and the same length as (or 
considerably shorter than) a "fragment", "portion", or "segment" of a polypeptide. Such 
15 sequences comprise a stretch of amino acid residues of at least about 5 amino acids and often 
about 17 or more amino acids, typically at least about 9 to 13 amino acids, and of sufficient 
length to display biological and/or antigenic activity. 

"Inhibitor" is any substance which retards or prevents a biochemical, cellular or 
20 physiological reaction or response. Common inhibitors include but are not limited to 
antisense molecules, antibodies, and antagonists. 

"Standard" is a quantitative or qualitative measurement for comparison. It is based 
on a statistically appropriate number of normal samples and is created to use as a basis of 
25 comparison when performing diagnostic assays, running clinical trials, or following patient 
treatment profiles. 

"Stringent conditions" is used herein to mean conditions that allow for hybridization 
of substantially related nucleic acid sequences. . Such hybridization conditions are described 
30 by Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor 
Press, 1989. Generally, stringency occurs within a range from about 5 °C below the melting 
temperature of the probe to about 20 °C - 25 °C below the melting temperature. As 
understood by ordinary skilled persons in the art, the stringency conditions may be altered in 
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order to identify or detect identical or related nucleotide sequences. Factors such as the 
length and nature (DNA, RNA, base composition) of the sequence, nature of the target (DNA, 
RNA, base composition, presence in solution or immobilization, etc.) and the concentration 
of the salts and other componenets (e.g. the presence or absence of formamide, dextran 
5 sulfate and/or polyethylene glycol) are considered and the hybridization solution may be 
varied to generate conditions of either low or high stringency. 

"Animal" as used herein may be defined to include human, domestic (cats dogs, etc.), 
agricultural (cows, horses, sheep, etc.) or test species (mouse, rat, rabbit, etc.). 

10 

"Nucleotide sequences" as used herein are oligonucleotides, polynucleotides, and 
fragments or portions thereof, and are DNA or RNA of genomic or synthetic origin which 
may be single or double stranded, and represent the sense or complement or antisense strands. 

"Sequence identity" is known in the art, and is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined y comparing 
the sequences, particularly, as determined by the match between strings of such sequences. 
Sequence identity can be readily calculated by known methods (Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1 993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991). While there exist a number of methods to measure identity between 
two sequences, the term is well known to skilled artisans (see, for example, Sequence 
Analysis in Molecular Biology; Sequence Analysis Primer; and Carillo, H., and Lipman, D., 
SIAM J. Applied Math., 48: 1073 (1988)). Methods commonly employed to determine 
identity between sequences include, but are not limited to those disclosed in Carillo, H., and 
Lipman, D., SIAM J. Applied Math., 48: 1073 (1988) or, preferably, in Needleman and 
Wunsch, J. Mol. Biol., 48: 443-445, 1970, wherein the parameters are as set in version 2 of 
DNASIS (Hitachi Software Engineering Co., San Bruno, CA). Computer programs for 
determining identity are publicly available. Preferred computer program methods to 
determine identity between two sequences include, but are not limited to, GCG program 

10 
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package (Devereux, J., et al, Nucleic Acids Research 12(1): 387 (1984)), BLASTP, 
BLASTN, and FASTA (Atschul, S.F. et al., J. Molec. Biol 215: 403-410 (1990)). The 
BLASTX program is publicly available from NCBI ( blast@ncbi.nlm.nih.gov ) and other 
sources (BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; 
Altschul, S., et al., J. Mol. Bio. 215: 403-410 (1990)). Computational Molecular Biology, 
Lesk, A.M, ed. Unless specified otherwise in the claims, the percent identity for the purpose 
of interpreting the claims shall be calculated by the Needleman and Wucnsch algorithm with 
the parameters set in version 2 of DNASIS. 

The present invention provides a nucleotide sequence uniquely identifying novel 
mammalian, including murine (MEDG-5) and human (HEDG-5), seven transmembrane 
receptor (T7G) or EDG-5. 

Based on the homology of HEDG-5 to human edg-2 (see table 2 below) it is likely 
that HEDG-5 binds a ligand of the same chemical class. Edg-2 specifically binds 
lysophosphatidic acid (LP A) which is a phospholipid. It was determined herein that HEDG-5 
also recognizes LPA as a functional agonist. 

Phospholipids have been demonstrated to be important regulators of cell activity, 
including mitogenisis (Xu et al. (1995) J. Cell. Physiol, 163: 441-450) and apoptosis, cell 
adhesion and regulation of gene expression. Specifically, for example, LPA elicits growth 
factor-like effects on cell proliferation (Moolenar (1996) J. Biol. Chem, 270: 12949-12952) 
and cell migration (Imamura et al. (1993) Biochem. Biophys. Res. Comm., 193: 497-503). It 
has also been suggested that LPA plays a role in wound healing and regeneration (Tigyi et al. 
(1992) J. Biol. Chem., 267: 21360-21367). Further, considerable circumstantial evidence 
indicates that phospholipids may be involved in various disease states including cancer 
(Imamura et al., (1993) Biochem. Biophys. Res. Comm., 193: 497-503); diseases having an 
inflammatory component (Fourcade et al. (1995), Cell, 80(6): 919-927, including adult 
respiratory distress, neurodegeneration (Jalink et al. (1993) Cell Growth Differ., 4: 247-255), 
rheumatoid arthritis (Natiarajan et al. (1995) J. Lipid Res., 36(9): 2005-2016), psoriasis and 
inflammatory bowel disease. Thus, ligands for HEDG-5, including LPA, are likely to be 
biologically important regulators of cell activity, and therefore aberrant expression or activity 
of HEDG-5 is likely to be associated with a chronic or acute disease states. Further, 
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modulators of HEDG-5 activity are likely to be useful in treatment or prevention of such 
disease states. 

HEDG-5 ligands, other than LP A, are likely to be found among the phospholipid class 
5 of compounds.. Therefore, in one embodiment, preferably phospholipid molecules should be 
screened to identify HEDG-5 ligands. Even more preferably, lysoglycerophospholipids 
should be screened, such as lysophosphatidylethanolamine (LPE), lysophosphatidylserine 
(LPS), lysophosphatidylinositol (LPI), lysophosphatidylcholine (LPC), lyso-platelet 
activating factor (lyso-PAF) and phosphatide acid. These ligands can be altered to improve 

10 metabolic stability, for example, by changing ester bond at Sn-1 to an ether on by blocking 
the free hydroxyl group with methoxy or acetyl ester. Additional medicinal chemistry 
benefits may be derived from shortening the fatty acid chain or altering the positioning of the 
phosphate. LPA and related phospholipids have limited solubility in aqueous solution and 
have a tendency to be sticky. These problems may be alleviated in a number of ways. For 

15 example, preparation of fresh stock solutions (e.g., 10 mM) by dissolving the phospholipid in 
calcium-free PBS and fatty-acid free BSA. Other related phospholipids can be prepared, for 
example, in 1 00% ethanol or DMSO 

A diagnostic test for aberrant expression of HEDG-5 can accelerate diagnosis and 
20 proper treatment of abnormal conditions of for example, the heart, kidney, lung and testis. 
Specific examples of conditions in which aberrant expression of HEDG-5 may play a role 
include adult respiratory distress, asthma, rheumatoid arthritis, cardiac ischemia, acute 
pancreatitis, septic shock, psoriasis, acute cyclosporine nephrotoxicity and early diabetic 
glomerulopathy, as well as lung damage following exposure to cigarette smoke, asbestos or 
25 silica. 

The nucleotide sequences encoding EDG-5 (or their complement) have numerous 
applications in techniques known to those skilled in the art of molecular biology. These 
techniques include use as hybridization probes, use in the construction of oligomers for PCR, 
30 use for chromosome and gene mapping, use in the recombinant production of EDG-5, and use 
in generation of antisense DNA or RNA, their chemical analogs and the like. Uses of 
nucleotides encoding EDG-5 disclosed herein are exemplary of known techniques and are not 
intended to limit their use in any technique known to a person of ordinary skill in the art. 

12 
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Furthermore, the nucleotide sequences disclosed herein may be used in molecular biology 
techniques that have not yet been developed, provided the new techniques rely on properties 
of nucleotide sequences that are currently known, e.g., the triplet genetic code, specific base 
pair interactions, etc. 

5 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of EDG-5-encoding nucleotide sequences may be produced. 
Some of these will only bear minimal homology to the nucleotide sequence of the known and 
naturally occurring EDG-5. The invention has specifically contemplated each and every 
1 0 possible variation of nucleotide sequence that could be made by selecting combinations based 
on possible codon choices. These combinations are made in accordance with the standard 
triplet genetic code as applied to the nucleotide sequence of naturally occurring edg-5, and all 
such variations are to be considered as being specifically disclosed. 

1 5 Although the nucleotide sequences which encode EDG-5, its derivatives or its variants 

are preferably capable of hybridizing to the nucleotide sequence of the naturally occurring 
edg-5 under stringent conditions, it may be advantageous to produce nucleotide sequences 
encoding EDG-5 or its derivatives possessing a substantially different codon usage. Codons 
can be selected to increase the rate at which expression of the peptide occurs in a particular 

20 prokaryotic or eukaryotic expression host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding EDG-5 and/or its derivatives without altering the encoded aa 
sequence include the production of RNA transcripts having more desirable properties, such as 
a greater half-life, than transcripts produced from the naturally occurring sequence. 

25 

Human genes often show considerable actual polymorphism; that is, variation in 
nucleotide sequence among a fraction of the entire human population. In many cases this 
polymorphism can result in one or more amino acid substitutions. While some of these 
substitutions show no demonstrable change in function of the protein, others may produce 
30 varying degrees of functional effects. In fact, many natural or artificially produced mutations 
can lead to expressible HEDG proteins. Each of these variants, whether naturally or 
artificially produced, is considered to be equivalent and specifically incorporated into the 
present invention. 
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Nucleotide sequences encoding EDG-5 may be joined to a variety of other nucleotide 
sequences by means of well established recombinant DNA techniques (Sambrook J et al 
5 (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

Spring Harbor NY; or Ausubel FM et al (1989) Current Protocols in Molecular Biology, John 
Wiley & Sons, New York City). Useful nucleotide sequences for joining to edg-5 include an 
assortment of cloning vectors such as plasmids, cosmids, lambda phage derivatives, 
phagemids, and the like. Vectors of interest include expression vectors, replication vectors, 
10 probe generation vectors, sequencing vectors, etc. In general, vectors of interest may contain 
an origin of replication functional in at least one organism, convenient restriction 
endonuclease sensitive sites, and selectable markers for one or more host cell systems. 

Another aspect of the subject invention is to provide for edg-5-specific hybridization 

15 probes capable of hybridizing with naturally occurring nucleotide sequences encoding EDG- 
5. Such probes may also be used for the detection of similar T7G encoding sequences and 
should preferably contain at least 56% nucleotide identity, more preferably at least 70% 
identity, to edg-5 sequence. The hybridization probes of the subject invention may be derived 
from the nucleotide sequence presented as SEQ. ED NO: 12 or from genomic sequences 

20 including promoter, enhancers, introns or 3' -untranslated regions of the native gene. 

Hybridization probes may be labeled by a variety of reporter molecules using techniques well 
known in the art. Preferably, the hybridization probes incorporate at least 15 nucleotides, and 
preferably at least 25 nucleotides, of the edg-5 receptor, more particualrly of the medg-5 or 
the hedg-5 receptor. Suitable hybridization probes would include: consensus fragments, i.e. 

25 those regions of the mouse and human edg-5 receptor that are identical (See Figure 5B); the 
extracellular edg-5 binding domain, the stipulated transmembrane regions and the C-terminal 
portion of the receptor. 

It will be recognized that many deletional or mutational analogs of nucleic acid 
sequences for EDG-5 will be effective hybridization probes for EDG-5 nucleic acid. 

30 Accordingly, the invention relates to nucleic acid sequences that hybridize with such EDG-5 
encoding nucleic acid sequences under stringent conditions. 
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"Stringent conditions" refers to conditions that allow for the hybridization of 
substantially related nucleic acid sequences. For instance, such conditions will generally 
allow hybridization of sequence with at least about 70% identity, preferably with at least 80- 
85% sequence identity, more preferably with at least about 90% sequence identity, and even 
5: more preferably with at least about 95% sequence identity. Such hybridization conditions are 
described by Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold 
Spring Harbor Press, 1989. Hybridization conditions and probes can be adjusted in well- 
characterized ways to achieve selective hybridization of human-derived probes. Nucleic acid 
molecules that will hybridize to EDG-5 encoding nucleic acid under stringent conditions can 

10 be identified functionally, using methods outlined above, or by using for example the 

hybridization rules reviewed in Sambrook et al., Molecular Cloning: A Laboratory Manual, 
2nd ed., Cold Spring Harbor Press, 1989. Without limitation, examples of the uses for 
hybridization probes include: histochemical uses such as identifying tissues that express 
EDG-5; measuring mRNA levels, for instance to identify a sample's tissue type or to identify 

15 cells that express abnormal levels of EDG-5; and detecting polymorphisms in the EDG-5. 
RNA hybridization procedures are described in Maniatis et al. Molecular Cloning, a 
Laboratory Manual (Cold Spring Harbor Press, 1989). PCR as described US Patent No's. 
4,683,195; 4,800,195; and 4,965,188 provides additional uses for oligonucleotides based 
upon the nucleotide sequence which encodes the edg-5 sequences of the invention. Such 

20 probes used in PCR may be of recombinant origin, chemically synthesized, or a mixture of 
both. Oligomers may comprise discrete nucleotide sequences employed under optimized 
conditions for identification of edg-5 in specific tissues or diagnostic use. The same two 
oligomers, a nested set of oligomers, or even a degenerate pool of oligomers may be 
employed under less stringent conditions for identification of closely related DNA's or 

25 RNA's. Rules for designing PCR primers are now established, as reviewed by PCR 

Protocols, Cold Spring Harbor Press, 1991. Degenerate primers, i.e., preparations of primers 
that are heterogeneous at given sequence locations, can be designed to amplify nucleic acid 
sequences that are highly homologous to, but not identical to edg-5. Strategies are now 
available that allow for only one of the primers to be required to specifically hybridize with a 

30 known sequence. See, Froman et al., Proc. Natl. Acad. Sci. USA 85: 8998, 1988 and Loh et 
al., Science 243: 217, 1989. For example, appropriate nucleic acid primers can be ligated to 
the nucleic acid sought to be amplified to provide the hybridization partner for one of the 
primers. In this way, only one of the primers need be based on the sequence of the nucleic 
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acid sought to be amplified. PCR methods of amplifying nucleic acid will utilize at least two 
primers. One of these primers will be capable of hybridizing to a first strand of the nucleic 
acid to be amplified and of priming enzyme-driven nucleic acid synthesis in a first direction. 
The other will be capable of hybridizing the reciprocal sequence of the first strand (if the 

5 sequence to be amplified is single stranded, this sequence will initially be hypothetical, but 
will be synthesized in the first amplification cycle) and of priming nucleic acid synthesis from 
that strand in the direction opposite the first direction and towards the site of hybridization for 
the first primer. Conditions for conducting such amplifications, particularly under preferred 
stringent hybridization conditions, are well known. See, for example, PCR Protocols, Cold 

1 0 Spring Harbor Press, 1991. 

Other means of producing specific hybridization probes for edg-5 include the cloning 
of nucleic acid sequences encoding EDG-5 or EDG-5 derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available 
15 and may be used to synthesize RNA probes in vitro by means of the addition of the 

appropriate RNA polymerase as T7 or SP6 RNA polymerase and the appropriate reporter 
molecules. 

More particularly, a method for detection of polynucleotides that hybridize with hedg- 
20 7 is exemplified in Example 4, wherein a positive test correlates to approximately at least 
70% identity, and more preferably at least 80-85% sequence identity. 

It is possible to produce a DNA sequence, or portions thereof, entirely by synthetic 
chemistry. After synthesis, the nucleic acid sequence can be inserted into any of the many 
25 available DNA vectors and their respective host cells using techniques which are well known 
in the art. Moreover, synthetic chemistry may be used to introduce mutations into the 
nucleotide sequence. Alternately, a portion of sequence in which a mutation is desired can be 
synthesized and recombined with longer portion of an existing genomic or recombinant 
sequence. 



30 



The nucleotide sequence for hedg-5 can be used in an assay to detect inflammation or 
disease associated with abnormal levels of HEDG-5 expression. The cDNA can be labeled 
by methods known in the art, added to a fluid, cell or tissue sample from a patient, and 
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incubated under hybridizing conditions. After an incubation period, the sample is washed 
with a compatible fluid which optionally contains a reporter molecule. After the compatible 
fluid is rinsed off, the reporter molecule is quantitated and compared with a standard as 
previously defined. 

5" 

The nucleotide sequence for hedg-5 has been used to construct hybridization probes 
for mapping the native gene. The edg-5 gene was mapped to a band p22.3 of chromosome 1 
using bacterial artificial chromosomes isolated (BACs), as detailed in Example 16. Thus, the 
invention provides expression products from this locus that hybridize with hedg-5 (SEQ ID 
1 0 NO: 1 2) under stringent conditions. In situ hybridization of chromosomal preparations and 
physical mapping techniques such as linkage analysis using established chromosomal 
markers are invaluable in extending genetic maps. Examples of genetic map data can be 
found in the yearly genome issue of Science (e.g. 1994, 265:1981f). 

15 New nucleotide sequences can be assigned to chromosomal subregions by physical 

mapping. The mapping of new genes or nucleotide sequences provide useful landmarks for 
investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once a disease or syndrome, such as ataxia telangiectasia (AT), has been crudely 
localized by genetic linkage to a particular genomic region, for example, AT to 1 lq22-23 

20 (Gatti et al (1988) Nature 336:577-580), any sequences mapping to that area may represent or 
reveal genes for further investigation. The nucleotide sequence of the subject invention may 
also be used to detect differences in gene sequence between normal and carrier or affected 
individuals. 

25 Nucleotide sequences encoding edg-5 may be used to produce a purified oligo - or 

polypeptide using well known methods of recombinant DNA technology. Goeddel (1990, 
Gene Expression Technology, Methods and Enzymology, Vol. 185, Academic Press, San 
Diego CA) is one among many publications which teach expression of an isolated nucleotide 
sequence. The oligopeptide may be expressed in a variety of host cells, either prokaryotic or 

30 eukaryotic. Host cells may be from the same species from which the nucleotide sequence 
was derived or from a different species. Advantages of producing an oligonucleotide by 
recombinant DNA technology include obtaining adequate amounts of the protein for 
purification and the availability of simplified purification procedures. 
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Cells transformed with DNA encoding EDG-5 may be cultured under conditions 
suitable for the expression of T7Gs, their extracellular, transmembrane or intracellular 
domains and recovery of such peptides from cell culture. EDG-5 (or any of its domains) 
5 produced by a recombinant cell may be secreted, expressed on cellular membranes, or may be 
contained intracellular^, depending on the particular genetic construction used. In general, it 
is more convenient to prepare recombinant proteins in secreted form. Purification steps vary 
with the production process and the particular protein produced. Often an oligopeptide can be 
produced from a chimeric nucleotide sequence. This is accomplished by ligating the 
1 0 nucleotides from edg-5 or a desired portion of the polypeptide to a nucleic acid sequence 
encoding a polypeptide domain which will facilitate protein purification (Kroll DJ et al 
(1993) DNA Cell Biol. 12:441-53). 

In addition to recombinant production, fragments of EDG-5 may be produced by 
15 direct peptide synthesis using solid-phase techniques (e.g. Stewart at al (1969) Solid-Phase 
Peptide Synthesis, WH Freeman Co., San Francisco QA; Merrifield J (1963) J Am Chem. 
Soc. 85:2149-2154). Automated synthesis may be achieved, for example, using Applied 
Biosy stems 431 A Peptide Synthesizer (Foster City, CA) in accordance with the instructions 
provided by the manufacturer. Additionally, a particular portion of EDG-5 may be mutated 
20 during direct synthesis and combined with other parts of the peptide using chemical methods. 

EDG-5 for antibody induction does not require biological activity: however, the 
protein must be antigenic. Peptides used to induce specific antibodies may have an aa 
sequence consisting of at least five amino acids (aa), preferably at least 10 aa. They should 
25 mimic a portion of the aa sequence of the protein and may contain the entire aa sequence of a 
small naturally occurring molecule such as EDG-5. An antigenic portion of EDG-5 may be 
fused to another protein such as keyhole limpet hemocyanin, and the chimeric molecule used 
for antibody production. 

30 Antibodies specific for EDG-5 may be produced by inoculation of an appropriate 

animal with the polypeptide or an antigenic fragment. An antibody is specific for EDG-5 if it 
is produced against an epitope of the polypeptide and binds to at least part of the natural or 
recombinant protein. Antibody production includes not only the stimulation of an immune 
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response by injection into animals, but also analogous processes such as the production of 
synthetic antibodies, the screening of recombinant immunoglobulin libraries for specific- 
binding molecules (e.g. Orlandi R et al (1989) PNAS 86:3833-3837, or Huse WD et al (1989) 
Science 256:1275-1281) or the in vitro stimulation of lymphocyte populations. Current 
5 technology (Winter G and MIstein C (1991) Nature 349:293-299) provides for a number of 
highly specific binding reagents based on the principles of antibody formation. These 
techniques may be adapted to produce molecules which specifically bind EDG-5s. 

An additional embodiment of the subject invention is the use of HEDG-5 specific 
1 0 antibodies, inhibitors, ligands or their analogs as bioactive agents to treat inflammation or 
disease including, but not limited to viral, bacterial or fungal infections; allergic responses; 
mechanical injury associated with trauma; hereditary diseases; lymphoma or carcinoma; or 
other conditions which activate the genes of kidney, lung, heart, lymphoid or tissues of the 
nervous system. 

15 

Bioactive compositions comprising agonists, antagonists, receptors or inhibitors of 
HEDG-5 may be administered in a suitable therapeutic dose determined by any of several 
methodologies including clinical studies on mammalian species to determine maximal 
tolerable dose and on normal human subjects to determine safe dose. Additionally, the 
20 bioactive agent may be complexed with a variety of well established compounds or 

compositions which enhance stability or pharmacological properties such as half-life. It is 
contemplated that the therapeutic, bioactive composition may be delivered by intravenous 
infusion into the bloodstream or any other effective means which could be used for treating 
problems involving aberrant expression of the Edg-5 gene. 

25 

The examples below are provided to illustrate the subject invention. These examples 
are provided by way of illustration and are not included for the purpose of limiting the 
invention. 

30 EXAMPLES 

Example 1 : PGR cloning of murine edg-5 cDNA 
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Poly-A+ RNA was isolated from TR and TSM murine neuronal cell lines by twice 
selecting on oligo-dT cellulose (Pharmacia, Cat. 27-5649-01). 10.5 jag of this RNA was 
reverse-transcribed with oligo-dT or random hexamers as to prime the RT reaction. RNA and 
primers were heated to 65 °C for 5 min., then cooled to room temperature. Additional 
5 reagents were added to give the following final concentrations: 50 mM Tris-Cl, pH 8.3, 6 mM 
MgCl 2 , 40 mM KC1, 1 mM DTT, 1 mM each dNTP, and 1 unit/jil of Moloney murine virus 
RT enzyme. 

First strand cDNA was amplified in PCR reactions using degenerate primers Al (SEQ 
ID NO: 1) and Bl (SEQ ID NO: 2) was conducted as follows. PCR reactions used 40 ng of 
first strand cDNA in 10 mM Tris-Cl, pH 8.3, 50 mM KC1, 2 \iM of each primer, 1.5 mM 
MgCl 2 , 0.2 |aM each dNTP and 2.5 units of Taq DNA polymerase. Thirty pairwise 
combinations of primers were used. Reactions were placed in a Perkin-Elmer 480 thermal 
cycler, denatured for 3 min. at 94 °C, and then cycled 25-40 times at 96 °C for 45 sec, 47 °C 
for 144 sec or 53 °C for 216 sec, and 72 °C for 3 min. initially, increasing 6 sec/cycle. 
Products were cloned using the TA PCR cloning vector (Invitrogen, Cat. K2000-40). The 
resulting edg-5 clone, 501 (SEQ ED NO: 3) was sequenced by the dideoxy termination 
method. 

20 Al : 5 '-AAYTRS ATIMTISTIAAYYTIGCIGTIGCIGA-3 ' (SEQ ID NO: 1) 

B 1 : 5 '-CTGIYKWTTCATIAWIMMRTAIAYIAYIGGRTT-3 ' (SEQ ID NO: 2) 

The nucleotide sequence of clone 501(SEQ ID NO: 3) is shown in Figure 1A. A 
search of Genbank showed that clone 501 (SEQ ID NO: 3) was most closely related to the 
25 LPA receptor, also identified in Genbank as the GPCR orphan edg-2 (Genbank MMU70622). 
Sequence identity between the clone 501 (SEQ ID NO: 3) and edg-2 was 60.5% over the 639 
bp length of clone 501 (SEQ ID NO: 3). The amino acid sequence of this nucleotide is 
shown in Figure 2 (SEQ ID. NO: 15) 

30 Approximately 5x1 0 5 phage from an embryonic day 15 whole embryo lambda-ZAP 

cDNA library (Clontech) was screened with part of this PCR product, 501AB (the original 
PCR probe from the degenerated PCR screen, using PCR primers A and B), on conventional 
nylon filter lifts with 32 P-labeled probe, and washed to high stringency. Two clones were 

20 
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isolated and subcloned into the EcoRI site of pBluescript. One of the clones was sequenced 
as shown in Figure IB with the amino acid sequence shown in Figure 5B. 



5 EXAMPLE 2: Isolation of hedg-5 cDNA PCR amplification of partial hedg-5 gene 
from human genomic DNA 

PCR primers JC501-F2 (SEQ ID NO: 4) and JC501-R (SEQ ID NO: 5) were 
designed using the sequence of clone 501 (SEQ ID NO: 3) and used to obtain a PCR 
10 fragment of hedg-5, as detailed below with the Expand™ PCR system from Boehringer 
Mannheim (Cat. 1681-842). Human genomic DNA was obtained from Promega (Cat. 
G304A). 

JC501-F2: 

15 5 '-TTTTTACTCGAGATTTGCTGGTTATTGCTGTGG AAAG-3 ' (SEQ ID NO: 4) 



JC501-R: 

5 ^TTTTTTCTAGACGGTCATCACTGTCTTC ATTAGCTTC-3 ' (SEQ ID NO: 5) 



20 Each reaction contained the following reagents: 

30.25 pi water 
10 pi 2.5 mM dNTP mix 
5 pi lOx Expand™ Buffer 3 
25 1.5 pi 10 pMJC501-F2 primer 

1.5 pi 10 pMJC501-R primer 
0.75 pi Expand PCR enzyme (3.5 units/pl) 
1 pi human genomic DNA (0.272 pg/pl) 



30 PCR Conditions: 

Incubate: 94 ° C for 2 min. 

30 cycles: 92 ° C for 1 min. 

45 °C for 5 min. 
68 °C for 1 min. 

21 
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Hold: 4°C 



On ethidium bromide (EtBr)-stained agarose gel, an intense PCR product of about 390 
bp was seen. This product was reamplified in the following PCR reaction: 

5 

30.25 |il water 
10 |il 2.5 mM dNTP mix 
5fil lOx Expand™ Buffer 3 
1.5 |il 10 |iM JC501-F2 primer 
10 1.5 jal 10 |iMJC501-R primer 

0.75 |il Expand PCR enzyme (3.5 units/|il) 

1 \il PCR product from the previous PCR reaction 



PCR Conditions: 
1 5 Incubate: 94 ° C for 2 min. 

30 cycles: 92 ° C for 1 min. 

45°Cfor5min. 

68 °C for 1 min. 
Hold: 4°C 

20 

The intense 390 bp product of the PCR reamplification was excised from the agarose 
gel. The PCR products from 30 |il of the PCR reaction were purified from pooled gel slices 
using a Qiaquick Gel extraction kit (Qiagen Inc.; Cat. 28706) and eluted with 20 |il of 10 mM 
Tris-Cl, pH 8.5. The eluted DNA was quantitated and the sequence of the PCR product was 
25 determined by automated sequencing at Allelix's in-house facility, with an ABI 377 

Sequencer and fluorescent dideoxy terminators, using each primer from the PCR reactions 
shown above. 

Sequencing results showed 81.5% identity at the nucleotide level with the sequence of 
30 mouse clone 501 , over a 3 1 2 bp overlap excluding the primer sequences. 



PCR amplification and sequencing of large edg-5 cDNA fragments 
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Primers, H501-20F (SEQ ID NO: 6) and H501-246R(SEQ ID NO: 7), specific 
to hedg-5, were used to amplify cDNAs encoding larger portions of hedg-5 form a XgtlO fetal 
heart cDNA library, as follows. 

5 H501-20F: 

5 ATGCGGCTGC ATAGC AACCTGACC AAAAAG-3 ' (SEQ ID NO: 6) 
H501-246R: 

5 '-ATCCGC AGGTAC ACC AC AACC ATGATGAGG-3 ' (SEQ ID NO: 7) 



10 



Each reaction contained the following reagents: 



30.25 \xl water 
10^x1 2.5mMdNTPmix 
15 5 \x\ 1 Ox Expand™ Buffer 3 

1.5 jd 10 |iMH501-20F primer 
1.5 (0.1 10 H501-246R primer 
0.75 |il Expand PCR enzyme (3.5 units/^1) 

1 ^1 fetal heart cDNA library (>1 library equivalent/|il; Clontech; Cat. HL5017a) 

20 

PCR Conditions: 

Incubate: 94 ° C for 2 min. 

30 cycles: 92 ° C for 1 min. 

45°Cfor5min. 
25 68°Cfor 1.5 min. 

Incubate: 68 ° C for 8 min. 

Hold: 4°C 

On EtBr-stained agarose gel, a moderately intense 250 bp PCR product was 
30 seen in a fetal heart library, the approximate size expected from the positions of the primers. 
No specific PCR products were seen in any of 13 other cDNA libraries tested. 
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To obtain additional edg-5 sequence, and possibly amplify the full-length cDNA from 
the fetal heart cDNA libraiy, PCR reactions were conducted using JC501-F2 (SEQ ID NO: 4) 
or JC501-R (SEQ ID NO: 5) primers versus primers derived from the lgtlO vector in which 
this cDNA library was constructed. Although cDNA inserts are not directionally cloned into 
5 the lgtlO vector, we chose to amplify products only from one direction. The vector-based 
primer sequences were: 

GT10-F: 5 '-TTTTGAGC AAGTTCAGCCTGGTTAAGT-3 ' (SEQ ID NO: 8) 

GT10-R: 5 ' -TGGCTTATG AGT ATTTCTTCC AGGGT A-3 ' (SEQ ID NO: 9) 

10 

One PCR reaction was done with JC501-F2 (SEQ ID NO: 4) vs. GT10-R (SEQ ID 
NO: 9) primers to amplify the 3' end of edg-5 cDNA clones, and another was done with 
GT10-F (SEQ ID NO: 8) vs. JC501-R (SEQ ID NO: 5) primers to amplify the 5' end of edg-5 
15 cDNA clones. Each 40 jal reaction contained the following reagents: 



20 



23.6 |il water 

8.0^1 2.5mMdNTPmix 

4\i\ 1 Ox Expand™ Buffer 3 

2.0 \x\ 10 edg-5 specific primer 

0 . 8 |al 10 jxM vector primer 

0.6 |Lil Expand PCR enzyme (0.4 units) 

1 \il cDNA library stock (>1 library equivalent/^!; Clontech; Cat. HL5017a) 



25 



PCR Conditions: 



Incubate: 



94°Cfor2min. 



30 cycles: 



92°Cfor30 sec 



55°Cfor2min. 



68°Cfor3min. 



30 



Incubate: 



68°Cfor8min. 



Hold: 



4°C 
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The results showed 2 faint PCR products (designated 510-5-1 and 510-5-2) from the 
3'-end PCR reaction (JC501-F2 (SEQ ID NO:4 /GT10-R (SEQ ID NO:9). From the 5'-end 
PCR reaction (GT10-F (SEQ ID NO: 8)/JC501-R (SEQ ID NO:5) again 2 faint PCR bands 
(designated 510-6-1 and 510-6-2) were seen. Each band was tip-eluted from the gel by 
5 stabbing the band with a fresh Pipetman plugged tip, which was then rinsed into 50 \x\ of TE, 
pH 8. This solution was used as a stock from which nested reamplifications were done, using 
the same vector primer vs. a nested human-specific primer as follows: 



11.5 ^il water 

10 4.0|il 2.5mMdNTPmix 

2jil lOx Expand™ Buffer 1 

0.6 ^il 10 nM edg-5 specific primer 

0.6^1 10 |aM vector primer 

0.3 ^1 Expand PCR enzyme (0.4 units) 

15 1 yd tip-eluted PCR DN A stock 

PCR Conditions: 

Incubate: 94 ° C for 2 min. 

30 cycles: 92 ° C for 30 sec 

20 55°Cfor40sec 
68°Cfor-3min. 
Incubate: 68 ° C for 8 min. 

Hold: 4°C 



25 DNA from the most intense band of each nested reamplification was purified 

using a QIAquick Gel extraction kit and eluted in 50 ul of 10 mM Tris-Cl, pH 8.5. 

Full-length cloning of the hedg-5 cDNA into pcDNA3 vector 

30 Extension PCR (cycles without primers) was used to extend the overlapping ~1 .0 kb 

3' fragment (designated 511-5: reamplified from 510-5-2) and 700 bp 5' fragment (designated 
511-14: reamplified from 510-6-2) as follows: 
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Extension PCR : 

19.8ul water 

5.6 ul 2.5 mM dNTP mix 
4.0 ul lOx Expand™ Buffer 1 
5 5 fal edg-5 3 ' PCR DNA fragment (5 1 1 -5) 

5 ul edg-5 5 ' PCR DNA fragment (511-14) 
0.6 ul Expand PCR enzyme (3.5 units/ul) 

PCR Conditions: 

Incubate 94 ° C for 2 min. 

15 cycles: 92 °C fori min. 

60°Cfor 10 min. 
68°Cfor3.5min. 
Incubate: 68 0 C for 8 min. 

Hold: 4°C 

Two microliters of the extension PCR reaction was then reamplified using the two 
vector primers (GT10-F (SEQ ID NO:8) and GT10-R (SEQ ID NO:9) to select for full-length 
extension products. 

32.25 p.1 water 
7.0 ul 2.5 mM dNTP mix 
5.0 ul lOx Expand™ Buffer 1 
1.5 ul 10 uMGTlO-F primer 
1.5 ul 10 uMGTlO-R primer 
0.75 uIExpand PCR enzyme (3.5 units/ul) 

PCR Conditions: 

Incubate 94 ° C for 2 min. 

30 cycles: 92 ° C for 40 sec 

30 50°Cfor40sec 
68 °C for 3 min. 
Incubate: 68 ° C for 8 min. 
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Hold: 4°C 

After gel electrophoresis of the PCR products, an intense DNA band of about 1.4 kb 
was seen. The PCR product was purified with a QIAquick PCR purification kit (QIAGEN 
5 Inc., Cat. 28106), eluted in 50 ^il of 10 mM Tris-Cl, pH 8.5. The gel-purified PCR fragment 
was then sent for automated sequencing at Allelix's in-house facility, as described above. The 
sequencing results confirmed the identity of the amplified band as edg-5, and suggested that a 
full-length clone of edg-5 had been reconstructed by extension PCR. 

1 o To subclone into pcDNA3 the above DNA was re-amplified with modified vector 

primers GT10-5KXb (SEQ ID NO: 10) and GT10-3BXh (SEQ ID NO: 11). 

GT10-5KXb : 

S'-GGGTAGTCGGTACCTCTAGAGCAAGTTCAGCC- 3' (SEQ ID NO: 10) 
15 GT10-3BXh: 

5'- ATAACAGAGGATCCTCGAGTATTTCTTCCAG- 3' (SEQ ID NO: 1 1) 

Reamplification PCR: 

67.5 ^1 water 
20 14 fil 2.5 mM dNTP mix 

10 jil lOx Expand™ Buffer 1 

3 |il 10 GT10-5KXb primer 

3 |al 1 0 nM GT 1 0-3BXh primer 

1 .5 nl Expand PCR enzyme (3.5 units/pl) 
25 1 fil DNA from previous PCR reaction 

PCR Conditions: 

Incubate 94°Cfor2min. 
5 cycles: 92 ° C for 1 min. 

30 50 °C fori min. 

68°Cfor2min. 
25 cycles: 92 ° C for 1 min. 

60°Cfor 1 min. 
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68°Cfor2min. 
Incubate: 68 ° C for 8 min. 

Hold: 4°C 

5 The PCR product was QIAquick PCR-purified and eluted in 50 jil of 10 mM Tris-Cl, 

pH 8.5 as described previously and restricted with Kpnl and XhoL 

Restriction digest of PGR sample with Kpnl and Xhol: 
Two successive restriction digests was performed on the purified extension PCR product as 
10 follows: 

38 |al Extension PCR DNA 

5 |il 10X NEBuffer 1 (New England Biolabs [NEB]) 

2 |il Kpnl restriction endonuclease (1 0 units; NEB, Cat #142S) 
15 5 \il 1 OX Acetylated BSA stock (NEB) 

The restriction digest was incubated for 1 hour in a 37 ° C water bath, and then the 
following reagents and enzyme were added: 

20 1 0 nl 1 OX NEBuffer 2 (NEB) 

1 ill Xhol restriction endonuclease (20 units; NEB, Cat #146S) 
5 (al 1 OX Acetylated BSA stock (NEB) 
43 |al water 

25 The reaction products were purified using a QIAquick PCR purification kit and eluted in 50 
III of lOmMTris-Cl, pH 8.5. 

Preparation of pcDNA3 cloning vector with Kpnl and Xhol: 

4 (al pcDNA3 plasmid DNA (Invitrogen; Cat. V790-20) containing a 1 .8 kb cDNA 

30 insert 

10 ^ 1 OX NEBuffer 2 (NEB) 

3 |al Kpnl restriction endonuclease (NEB: 1:10 dilution; 3 units) 
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3 |il Xhol restriction endonuclease (NEB: 1 :20 dilution; 3 units) 
10 nl 10X Acetylated BSA stock (NEB) 
64 fj.1 water 

5 The vector DNA was digested for 1 hour at 37 ° C. Then, 3 units more of each enzyme 

was added and the tubes were incubated for a further 2 hr. The digest was run on a gel, and 
the vector DNA band without cDNA insert was excised, purified using GeneClean II kit (BIO 
101) and eluted in 40 ^1 of 10 mM Tris-Cl, pH 8.5. 

10 The double-digested, gel-purified PCR DNA was ligated into the prepared pcDNA3 

plasmid vector using T4 DNA ligase kit (NEB, Cat. 202CS) and transformed into Epicurean 
Coli XL-2 Blue MRF' Ultracompetent cells (Stratagene, Cat. 200150). The transformation 
was plated onto 2xYT/Ampicillin plates and single colonies were picked. DNA minipreps 
were made using QIAGEN QIA-Prep8 miniprep kit (Cat. 27144) and clones with appropriate 

15 inserts were identified by sequencing, canied out with the in-house ABI automated 

sequencing system. From this analysis, a clone designated pC3-hedg-5-3 (SEQ ID NO:13) 
was chosen for complete sequence determination of the cDNA insert. 

Features of the hedg-5 cDNA 

20 

A BLAST search of Genbank, EMBL, dbEST, and the GSS and STS genomic 
sequencing databases indicates that the hedg-5 sequence is novel. The bovine LPA receptor, 
edg-2, was the highest-scoring full-length cDNA sequence found from the combined 
Genbank/EMBL databases (Genbank BTU48236: 55% identity). 

25 

This sequence includes 10 bp of 5' -untranslated sequence, the edg-5 open reading 
frame of 1059 bp, and a 3' -untranslated region spanning 204 bp. The coding region of edg-5 
begins with the first methionine codon, at nt 36-38 and terminates with the stop codon at nt 
1095-1097. The prediction of this open reading frame is supported by the sequence of 
30 genomic DNA flanking the 5' end of the cDNA sequence (see below). 250 bp of 5' flanking 
sequence was obtained from a BAC genomic clone as described in Example 16 (Figure 4A, 
SEQ ID NO: 12). The proposed translation start site was preceded by an in-frame stop 
codon 24 bp upstream. Sequencing of different clones revealed the existence of several 
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sequence polymorphisms, which may represent a sampling of natural variability of the edg-5 
sequence within the human population. The 15 polymorphisms observed within the edg-5 
open reading frame are listed below. Nine of these substitutions did not result in a change in 
the encoded amino acid), while 3 resulted in conservative substitutions and 3 resulted in 
5 nonconservative substitutions. 



Table 1. Apparent polymorphisms in the hedg-5 protein coding region. 





Nucleotide 


Affected Codon 


Amino Acid 


Consequence 


10 


Position 


& Polymorphism 


Predicted 






491 


TTC 


Phenylalanine 








TTT 


Phenylalanine 


Silent 




585 


CTG 


Leucine 




15 




TTG 


Leucine 


Silent 




716 


GTC 


Valine 








GTT 


Valine 


Silent 




779 


ATC 


Isoleucine 








ATT 


Isoleucine 


Silent 


20 


781 


TCT 


Serine 


Nonconservative 






TTT 


Phenylalanine 


Substitution 




788 


TGC 


Cysteine 








TGT 


Cysteine 


Silent 




790 


TCT 


Serine 


Nnn con servati ve 


25 




TTT 


Phenylalanine 


Substitution 




830 


TTC 


Phenylalanine 








TTT 


Phenylalanine 


Silent 




874 


GTG 


Valine 


Conservative 






GCG 


Alanine 


Substitution 


30 


887 


ATC 


Isoleucine 








ATT 


Isoleucine 


Silent 




914 


AAC 


Asparagine 


Conservative 






AAA 


Lysine 


Substitution 




917 


GTC 


Valine 




35 




GTT 


Valine 


Silent 
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922 



1041 



1277 



TCT 

TTT 

CTC 

TTC 

GAG 

GAA 



Serine 

Phenylalanine 

Leucine 

Phenylalanine 

Glutamate 

Glutamate 



Nonconservative 
Substitution 
Conservative 
Substitution 

Silent 



The edg-5 open reading frame of the P C3-hEdg5-3 (SEQ ID NO:13; Figure 3A) clone 
10 predicts a 353 amino acid polypeptide (SEQ ID NO: 1 4, Figure 4A) with many typical 
features of a GPCR. These include: 

1 . A hydropathy profile consistent with the 7 transmembrane structure of GPCRs: 

• N-terminal extracellular domain: 1-30 
15 • TM-1: 31-56 

• IL- 1:57-63 

• TM-2: 64-92 

• EL-1: 93-106 

• TM-3: 107-125 
20 • IL-2: 126-144 

• TM-4: 145-170 

• EL-2: 171-186 

• TM-5: 187-207 

• IL-3: 208-239 
25 • TM-6: 240-261 

• EL-3: 262-276 

• TM-7: 277-297 

• C-terminal cytoplasmic domain: 298-353 

2. Potential N-glycosylation site in the extracellular N-terminal domain, at residue 1 5 
30 3. Potential protein kinase C phosphorylation sites at residues 141, 229 and 303 

4. Potential cAMP- and cGMP-dependent kinase phosphorylation sites at residues 217,233 
and 321 

5. Potential casein kinase-H phosphorylation site at residue 329 
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The amino acid sequence of the human edg-5 receptor, SEQ ID NO: 14 (Figure 4A), also 
shows high homology with other members of the edg subfamily of GPCRs. . The pairwise 
percent identity and similarity is presented in Table 2 below: 
5 Table 2. 

Percent Amino Acid Identity and Similarity of Edg Family Sequences to the Human Edg-5 
receptor 



Gene 


Percent Identity 


Percent Similarity 


Edg-1 (Human) 


30.1 


40.9 


Edg-2 (Human) 


48.6 


59.0 


Edg-2 (Bovine) 


55.1 




Edg-3 (Human) 


32.6 


43.3 


H218(Edg-4-Rat) 


31.6 


40.6 


Edg-6 (Human) 


46.0 


55.5 



10 Multiple sequence alignment indicates that edg-2 is the closest known relative of edg-5 at the 
amino acid sequence level, as suggested by the DNA sequence. The edg-5 gene product is 
also closely related to edg-6, a novel edg gene described in copending application USSN 
08/763,938. Edg-2, edg-5 and edg-6 appear to form a subfamily distinct from edg-1, edg-3 
and edg-4 within the larger edg gene family. 

15 

Alternative splicing variants of murine edg-2 have been found, which differ in length 
within the N-terminal coding region. The longer open reading frame (Genbank, accession 
no:MMU70622) encodes an 18-amino acid N-terminal extension of the shorter open reading 
frame (Genbank, accession no:MMU48235), and retains the initiator methionine codon of the 

20 shorter product as amino acid 19 of the longer product. Due to the sequence relatedness of 
edg-2 and edg-5 and the fact that the methionine codon of the shorter edg-2 product aligns 
closely with the initiation codon of hedg-5, the edg-5 open reading frame hedg-5 may encode 
a similar N-terminal extension to the HEDG-5 peptide of SEQ ID NO: 14. Such an extension 
will result from splicing of sequences found upstream of the hedg-5 sequences presented 

25 herein, and will produce one or more spliced mRNA variants with a N-terminal extensions. 
Briefly, given the instant disclosure the skilled artisan could discover such splice variants by 
5' RACE using a commercially available 5' RACE kit (Life Technologies, Cat No:18374- 
041) using the approach detailed in start protocols in Molecular Biology (2nd edition, 15-27). 
Briefly, first strand cDNA is primed using an antisense oligonucleotide specific for hedg-5 
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and ideally directed to a sequence about 500 nucleotides from the 5' end of the known hedg-5 
sequence; kidney and lung RNA are preferred templates for cDNA synthesis. Thereafter, first 
strand cDNA is then tailed using terminal transferase, for example, with deoxyguanine 
residues. PCR amplification is primed using an anchor primer complementary to the 
5 polyguanine tail and a nested primer specific to hedg-5. 

EXAMPLE 3: Molecular cloning of hedgS coding region for expression and functional 
analysis in eukaryotic cells. 

10 After surveying various cDNA libraries and first strand cDNA preparations, we were 

unable to obtain a full-length clone. The rarity of edg-5 in cDNA libraries is further supported 
by a complete absence of EST's from the edg-5 coding regions in the DBEST database, 
which contains millionsof individual EST's. Therefore, an alternative approach was designed. 
In this approach, the coding region would be amplified in two fragments from genomic DNA, 

15 since we previously determined the location of the single splice site that occurs (between nt 
771/772 of SEQUENCE ID NO: 13) in the genomic DNA encoding HEDG5. Then, the two 
fragments would be joined by an extension PCR in which primers were engineered to contain 
a 30 bp overlap.between the two fragments to obtain a functional, full-length edg5 cDNA, 
DNA fragments from two exons next to intron located at nt 996/997 were PCR amplified 

20 using the following primers so that they have an overlap of 30 nt. 

5' Exon Fragment 

HE5-261F: [5 '-ATGAATGAGTGTCACTATGACAAG-3 '] 
25 HE5-101 1R: [5 , -ATACCACAAACGCCCCTAAGACAGTCATCACCGTCTTC-3 , ] 

3' Exon Fragment 

HE5-982F: [5 ' -TG ATGACTGTCTT AGGGGCGTTTGTGGT ATGCTGG ACC-3 ' ] 
30 HE5-1322R: [5 ' -TTAGGAAGTGCTTTTATTGC AGACTGC-3 *] 
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Human genomic DNA (Clontech, Cat #6550-1) was amplified with each pair of 
primers under the following condition of PCR amplification by using Expand™ PCR system 
from Boehringer-Mannheim (Cat. 168 1-842). 

5 Each reaction contained the following reagents: 

5.0 ul lOx PCR Buffer 3 
1.0 ul 25 mM dNTP mix 

1 .5 ul Primer HE5-261F or HE5-982F (10 pmol/1) 
10 1.5 ul Primer HE5-1011R or HE5-1322R (10 pmol/1) 

0.75 ul Expand™ Enzyme (7.5 units) 
38.25 ul water 
2.0 ul DNA 

15 PCR conditions: 

Incubate: 94°C for 2 min 

30 cycles: 94°C for 1 min 

55°Cfor2min 
68°C for 1 min 
20 Incubate: 68°C for 8 min 

Hold: 4°C 

DNA fragments of approximately 700 bp (5' exon) and 350 bp (3' exon) were 
amplified. The two DNA fragments were purified using Qiaquick gel extraction kit (Qiagen, 
25 Cat. 28706 ) and eluted in 50 ul of 10 mM Tris, pH 8.5. Extension PCR (cycles without 
primers) was then used to join the 5' exon and 3' exon fragments, which overlapped each 
other by 30 bp. 



30 



Extension PCR: 

Each reaction contained the following reagents: 
2.0 pi 1 Ox PCR Buffer 3 
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0.4 pi 25 mM dNTP mix 

0.3 pi Expand™ Enzyme (2.5 units) 

13.8 pi water 

1 .0 pi 5' exon PCR-amplified DNA 
5 1.0 ul 3' exon PCR-amplified DNA 



PCR conditions: 

Incubate: 94°C for 2 min 

20 cycles: 94°C for 1 min 

10 60°Cfor5min 

68°C for 1.5 min 

Incubate: 68°C for 8 min 

Hold: 4°C 

1 5 Five pi of the amplified product from the above PCR was then reamplified under the 

following condition of PCR with primers HE5-261F and HE5-1322R, described previously. 



Each reaction contained the following reagents: 



20 5.0 ul lOx PCR Buffer 3 

1.0 ul 25 mM dNTP mix 
1 .5 pi Primer HE5-261F (10 pmol/pl ) 
1.5 pi Primer HE5-1322R (10 pmol/pl ) 
0.75 pi Expand™ Enzyme (7.5 units) 

25 35.25 pi water 

5.0 pi DNA 



PCR conditions: 
Incubate: 
25 cycles: 

Incubate: 



94°C for 2 min 
94°C for 1 min 
55°C for 1 min 
68°C for 1 min 
68°C for 8 min 
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Hold: 



An intense DNA band of about 1 .0 kb was purified using the Qiaquick PCR 
purification kit (Qiagen), eluted in 50 jil of 10 mM Tris, pH 8.5 and was sent for direct PCR 
5 sequencing with each primer used in the above PCR reactions, as described previously. The 
resulting sequences showed 93 - 99% identity to human edg5 cDNA, within the edg-5 coding 
region. 

To subclone into pcDNA3.1 (Invitrogen; Cat. V795-20) the above DNA was reamplified with 
1 0 modified primers HE5-KZKF and HE5-Kpnl 322R under the following conditions: 



HE5-KZKF: [5'-TTTAAACTCGAGCCACCATGAATGAGTGTCACTATGAC - 3'] 
HE5-Kpnl322R: [5'-TATATAGGTACCTTAGGAAGTGCTTTTATTGCAGACTGC- 
3'] 



15 



Each reaction contained the following reagents: 



20 



5.0 \i\ lOx PCR Buffer 3 

1.0 yd 25mMdNTPmix 

L5 ^1 Primer HE5-KZKF (10 pmol/^1 ) 

1 .5 \il Primer HE5-Kpnl 322R (1 0 pmoV^il ) 

0.75 \xl Expand™ Enzyme (7.5 units) 

39.25 |xl water 

1.0 nl DNA 



25 



PCR conditions: 



Incubate: 



94°C for 2 min 



25 cycles: 



94°C for 40 sec 



30 



50°C for 1 min 



68°Cfor 1.5 min 



25 cycles: 



94°C for 40 sec 
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Incubate: 68°C for 8 min 

5 Hold: 4°C 

The PCR product was purified as described previously and subcloned into Kpnl and 
Xho 1 restriction sites of pcDNA3 . 1 . 

10 Plasmid DNA was prepared from several positive clones and cotransfected into 293- 

EBNA cells together with the 2xSRE-Luciferase reporter plasmid. 

Transient transfection protocol for 293-EBNA: 

15 Day 1. 

1) 100 mm plates of 293-EBNA with a confluency of -80% were used for transfection. 

2) NF-kB Reporter Gene Cotransfection: Expression plasmid (3.5 jag) and reporter plasmid 
(6XNF-kBtk-p4Luc-zeo; 0.5 ^ig) DNA samples were combined and diluted in 750 jil of 
DMEM/F12 (serum-free media) and 20 Plus Reagent (Lipofectamine Plus Kit, Life 

20 Technologies Cat. 10964-013), and incubated at room temperature for 15 min. 

3) 30 }i Lipofectamine Reagent (Lipofectamine Plus Kit) was diluted in 750 nl DMEM/F12. 
The diluted Lipofectamine was then combined with the DNA/Plus mixture and incubated at 
RT for 15 min. 

4) The 293-EBNA plates were washed once with PBS, and 5 ml DMEM/F12 was added to 
25 each plate. 

5) DNA/Plus/Lipofectamine mixture was added to each plate of 293-EBNA cells. The plates 
were left for 3 hr at 37°C in a 5% C0 2 incubator. 

6) The transfection medium was replaced with DMEM/F12 containing 10% FBS to recover 
overnight. 

30 

Day 2. 

1) Transfected cells were harvested by trypsinization and 20,000 cells per well were plated in 
96-well Blackview plates coated with poly D-lysine (Becton Dickinson Labware, Cat. 
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40640). Medium was DMEM/F12 containing 0.5% FBS. No cells were plated in the outside 
wells of the 96-well plate. Cells were returned to the incubator for 24 hr. 
Day 3. 

1) Media was removed and cells treated with compounds diluted in DMEM/F12 media 

5 containing 0.15% FBS and the following treatments: a) Untreated: DMEM/F12 plus 0.15% 
FBS; b) AN (10 jiM anandamide); c) LPA (10 ^M oleyl lysophosphatidic acid). 

2) The cells were treated overnight in the incubator. 

Day 4. 

10 1) Luclite kit (Packard; Cat. 601691 1) was used for luciferase assay. All reagents were 
brought to room temperature before use. 

2) Media was removed from each well. 50 jil 0.5M HEPES pH 7.8, 1 mM MgCl 2 , 1 mM 
CaCl 2 was added to all wells of 96-well plate. 

3) Luclite substrate was made up and 50 jil substrate was added to each well as specified in 
15 the kit. 

4) Plates were incubated at room temperature for 30 min. 

5) After incubation, plates were counted in a 12-detector Packard Top Count on a program 
without dark delay. 

20 Results: 

As we have documented elsewhere (See U.S. provisional patent application entitled 
"Identification of Lysolipid Receptors Involved in Inflammatory Response" filed on 
November 25, 1998 by MUNROE and corresponding PCT application filed on December 30, 
25 1998), edg-2, edg-5 and edg-6 proved to be inflammatory LPA receptor subtypes of the edg 
receptor family which when activated induce NF-kB. As exemplified in Figure 6, it was 
determined that HEDG-5, as particularly represented by the two clones pc3-hedg5#3-4 and 
pc3-hedg5#28, responded to LPA but not anandamide at 10 ^M to activate NF-kB. (See 
Figure 6) 

30 



38 

SDOCID: <WO 9933972A1_I_> 



WO 99/33972 PCT/CA98/01193 
Three inflammatory subtypes of lysophosphatidic acid (LP A) receptor. 



An additional experiment was carried out to test the response of clone #28 in reporter 
gene constructs with the serum response element (SRE) or the proximal 1 kb of the human 
5 collagenase gene promoter containing an inducible enhancer element for activator protein- 1 
(AP-1) together with the edg-2 and edg-6 receptor sub-types. As shown in Figure 7, the pC3- 
hedg5#28 showed an SRE response and an AP-1 response when treated with 10 \iM LP A. 

To determine whether these receptors might mediate inflammatory responses, each 
10 was cotransfected separately with SRE, NF-kB or AP-1 reporter genes. The AP-1 reporter 
contained approximately 1 kb of the human collagenase II promoter, and the first 50 bp of the 
5 '-untranslated region of the collagenase II transcription unit (Angel P, et al. Phorbol ester- 
inducible genes contain a common cis element recognized by a TPA-modulated trans-acting 
factor. Cell. 1987 Jun 19;49(6):729-739.)> a region whose inducible expression has been 
1 5 shown to be controlled by AP-1 . This transcription factor, like NF-kB has been implicated in 
inflammatory and neoplastic signal transduction., though the gene targets of its action are 
largely distinct from those of NF-kB ( Adcock IM. Transcription factors as activators of gene 
transcription: AP-1 and NF-B. Monaldi Arch Chest Dis. 1997 Apr;52(2): 178-86. Review.). 

20 293-EBNA cells were grown, lipofected in monolayer cultures, and pretreated as 

described above for Example 11, assay #1, except that NF-kB and AP-1 reporter-transfected 
cells were pretreated for 6 hr in medium containing 0.5% FBS, then treated overnight in the 
same medium with or without 10|iM LP A. 

25 Results: As shown in Fig. 7, all three receptors robustly activated the NF-kB reporter (about 
3-4-fold) in the presence of 10 pM LP A, while no response to LP A was seen when the NF- 
kB reporter was cotransfected with the empty expression vector pcDNA3. With the SRE and 
AP-1 reporter genes, some endogenous response to LPA was seen (about 1.5-fold vs 
untreated control cells). However, edg-6 strongly induced both reporters, while edg-2 and 

30 edg-5 caused greater than 2-fold induction of the SRE and AP-1 reporters with LPA. 
Therefore, all three LPA receptors tested here are capable of inducing inflammatory gene 
transcription through NF-kB, and perhaps, AP-1 as well. 
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EXAMPLE 4: Detection of hedg-5 polynucleotides by hybridization with hedg-5. 



10 



Edg polynucleotides can vary through the introduction of natural or artificial 
mutations or through cloning and subsequent manipulations. Moreover, the mammalian 
homolog of a given gene usually varies by 10-30% from species to species, as a result of 
nucleotide changes that have accumulated through their divergent evolutionary history. 
Therefore, a method is provided herein for the detection and identification of hedg variants 
and other highly related genes. 



The HEDG-5 coding region of hedg-5 is prepared by restriction of either pC3-hEdg5- 
3 or pC3-hedg5#3.4 or pC3-hedg5#28 with appropriate restriction enzymes to release the 
full-length hedg-5 insert, followed by cDNA insert purification using standard techniques 
after agarose gel electrophoresis. The cDNA insert may be labeled using 32 P-nucleotide end- 
1 5 labeling or random priming (several kits are commercially available), or through 

incorporation of non-natural nucleotides for later detection with antibodies by methods well 
known in the art. Nylon filters (e.g. Hybond N+, Cat. RPN132B) bearing a polynucleotide or 
mixture of polynucleotides are prepared by standard techniques. Examples include Southern 
blots, filter lifts from bacterial colonies or bacteriophage plaques and the like. 

20 

The dried filters are rehydrated in water, then prehybridized in a sealable bag with 10 
ml (or enough to cover filters and seal the bag) of hybridization solution (48% deionized 
formamide, 4.8x SSC [20x SSC is 3 M NaCl, 0.3 M sodium citrate, pH 7.0], lx Denhardt's 
solution [50x Denhardt's is 1% Ficoll 400, 1% polyvinylpyrrolidone, 1% BSA (Pentax 
25 Fraction V)], 10% dextran sulfate, 0.1% sodium dodecyl sulfate [SDS]) for 1 hr or more at 
42°C. 



Radiolabeled probe is added to 1 ml of sonicated herring sperm DNA (2 mg/ml) in a 
screw-cap tube and incubated in a boiling water bath for 10 min. Transfer the tube to ice, add 
30 2 ml of hybridization solution and inject the probe solution into the sealed bag. Sufficient 
probe should be added to give 1 to 15 ng of radiolabeled probe/ml hybridization buffer (final 
volume) at >5xl0 7 cpm/jag DNA. Reseal the bag, mix thoroughly and incubate overnight at 
42°C in a shaking or rotating water bath or incubator. 
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Wash filters three times with 500 ml of low-stringency wash buffer (2x SSC, 0.1% 
SDS) at RT for 15 min per wash, on a slowly rotating platform. Then wash two times with 
medium-stringency wash buffer (lx SSC, 0.1% SDS) at 65°C 15 min per wash. Dry the 
5 filters and expose to Phosphorimager cassette or autoradiography film. Positive spots or DNA 
bands are identified after subtraction of background or appropriate negative control samples 
(see below). 

If needed, a DNA spot containing 10 pmol of the full-length hedg insert of pC3- 
1 0 hEdg5-3 can be used as a positive control (Pos) on the filter, and a DNA spot containing 10 
pmol of full-length human edg-2 insert (edg-2 open reading frame only) can be used as a 
negative control (Neg). A full-length open reading frame, or a partial-length open reading 
frame, of a test DNA (also 10 pmol) will be scored as a positive if the integrated optical 
density (IOD) of the radioactive probe hybridizing to the test DNA (Test) is greater than 
1 5 IOD Neg + (IOD Pos - IOD Neg )/2. Otherwise, the test DNA will be scored as negative. A positive 
test correlates with approximately at least 70 % identitiy, and more preferably at least 80-85 
sequence identity. If a partial-length open reading frame of the test gene is used, then the 
equivalent regions of edg-5 and edg-2 will be used as positive and negative controls, 
respectively, for hybridization. 

20 

EXAMPLE 5: Antisense analysis 

Knowledge of the correct, complete cDNA sequence of HEDG-5 enables its use as a 
25 tool for antisense technology in the investigation of gene function. Oligonucleotides, cDNA 
or genomic fragments comprising the antisense strand of hedg-5 are used either in vitro or in 
vivo to inhibit expression of the mRNA. Such technology is now well known in the art, and 
antisense molecules can be designed at various locations along the nucleotide sequences. By 
treatment of cells or whole test animals with such antisense sequences, the gene of interest is 
30 effectively turned off Frequently, the function of the gene is ascertained by observing 
behavior at the intracellular, cellular, tissue or organismal level (e.g., lethality, loss of 
differentiated function, changes in morphology, etc.). 



41 



ISDOCID: <WO 9933972A1_I_> 



PCT/CA98/0U93 

WO 99/33972 

In addition to using sequences constructed to interrupt transcription of a particular 
open reading frame, modifications of gene expression is obtained by designing antisense 
sequences to intron regions, promoter/enhancer elements, or even to trans-acting regulatory 
genes. Similarly, inhibition is achieved using Hogeboom base-pairing methodology, also 
5 known as "triple helix" base pairing. 

EXAMPLE 6: Expression of HEDG-5 

Expression of hedg-5 is accomplished by subcloning the cDNAs into appropriate 
10 expression vectors and transfecting the vectors into analogous expression hosts for example 
E.Coli. In a particular case, the vector is engineered such that it contains a promoter for p- 
galactosidase, upstream of the cloning site, followed by sequence containing the amino- 
terminal Met and the subsequent 7 residues of |5-galactosidase. Immediately following these 
eight residues is an engineered bacteriophage promoter useful for artificial priming and 
1 5 transcription and for providing a number of unique endonuclease restriction sites for cloning. 

Induction of the isolated, transfected bacterial strain with IPTG using standard 
methods produces a fusion protein corresponding to the first seven residues of P- 
galactosidase, about 15 residues of "linker", and the peptide encoded within the cDNA. Since 

20 cDNA clone inserts are generated by an essentially random process, there is one chance in 
three that the included cDNA will lie in the correct frame for proper translation. If the cDNA 
is not in the proper reading frame, it is obtained by deletion or insertion of the appropriate 
number of bases using well known methods including in vitro mutagenesis, digestion with 
exonuclease in or mung bean nuclease, or the inclusion of an oligonucleotide linker of 

25 appropriate length. 

The hedg-5 cDNA is shuttled into other vectors known to be useful for expression of 
protein in specific hosts. Oligonucleotide primers containing cloning sites as well as a 
segment of DNA (about 25 bases) sufficient to hybridize to stretches at both ends of the target 
30 cDNA is synthesized chemically by standard methods. These primers are then used to 
amplify the desired gene segment by PCR. The resulting gene segment is digested with 
appropriate restriction enzymes under standard conditions and isolated by gel electrophoresis. 
Alternately, similar gene segments are produced by digestion of the cDNA with appropriate 
restriction enzymes. Using appropriate primers, segments of coding sequence from more 
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man one gene are ligated <oge«her and cloned in appropnale vectors. 1. is possible to 
optimize expression by construction of such chimeric sequences. 

Snitab.e expression hosts for such chimeric molecu.es include, but are no, limited .0, 
; mammalian ce,U auch as Chinese Hamster Ovary (CHO) and human 293 cells, insect ce,, 
such as SI9 cens, yeas, ceUs such as Saccharomyces cerevisiae, and bactena such as E. cob. 
For each of these cell systems, a useful expression vector also inchtdes an ongm of 
replication to allow propagation in bacteria and a selectable marker such as the p-iactamase 
antibiotic resistance gene ,0 allow plasmid selection m bacteria, m addition, me vector may 
„ include a second selectable marker such as the neomycin phosphotransferase gene to allow 
action in transfer eukaryodc host cens. Vectors for use in eukaryotic 
require RNA processing elements such as J- polyadeny.afion sequences if snob are not par, 
the cDNA of interest. 

Addi.iona.ly, the vector curtains promos or enhancers which increase gene 
expression. Such promoters are hos, specific and mc.ude MMTV, SV<0, and met—ne 
plo.ers for CHO cens; up. lac, tac and T7 promoters for bacteria, bos*; and a,ph^, 
lobo. oxidase and PGH promote, for yeas, Transcription enhancers, such as k - 
^coma vims enhancer, are used in mammal hos. ceUs. Once homogeneous cuUures of 
20 mcombinan.cellaareobamedmroughs.andamemmrememods.largeqnan.tttesof 

recombinanfly produce* HEDG-5 are recovered from me condifioned meuium and analy^d 

Coned in* fire expression veo.or pcDNA3, as exempfifted herein. This prodne. can be «d 
to transform, for example, HEK293 or COS by mefi.odo.ogy sfcndard in fire art. Spectftcally, 
25 for example, using Lipofeotomine (Gibco BRL ca*log no. 18324-020) media.*, gene 
transfer. 

EXAMPLE 7: Isolation of Recombinant HEDG-5 

HEDG-5 is expressed as a ehimerie protein with one or more additional polypeptide 
domainsaddedtofaeilitateproteinpurifieation. Sueh purifieation facilitating domams 
include, but arenot limited to, metal chelatmg peptides such as histidine-tryptophan modules 
that allow purification on immobilized metals, protein A domains that allow purification on 
mobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 



30 
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purification system (Immunex Corp., Seattle WA). The inclusion of a cleavable linker 
sequence such as Factor XA or enterokinase (Invitrogen) between the purification domain 
and the HEDG-5 sequence is useful to facilitate expression of HEDG-5. 

5 EXAMPLE 8: Testing of Chimeric T7Gs 

Functional chimeric T7Gs are constructed by combining the extracellular and/or 
transmembrane ligand-receptive sequences of a new isoform with the transmembrane and/or 
intracellular segments of a different T7G for test purposes. This concept was demonstrated 

10 by Kobilka et al (1988, Science 240:1310-1316) who created a series of chimeric a2-p2 
adrenergic receptors (AR) by inserting progressively greater amounts of a2-AR 
transmembrane sequence into p2-AR. The binding activity of known agonists changed as the 
molecule shifted from having more cc2 than p2 conformation, and intermediate constructs 
demonstrated mixed specificity. The specificity for binding antagonists, however, correlated 

1 5 with the source of the domain VII. The importance of T7G domain VII for ligand recognition 
was also found in chimeras utilizing two yeast a-factor receptors and is significant because 
the yeast receptors are classified as miscellaneous receptors. Thus, functional role of specific 
domains appears to be preserved throughout the T7G family regardless of category. 

20 In parallel fashion, internal segments or cytoplasmic domains from a particular 

isoform are exchanged with the analogous domains of a known T7G and used to identify the 
structural determinants responsible for coupling the receptors to trimeric G-proteins 
(Dohlman et al (1991) Annu Rev Biochem 60:653-88). A chimeric receptor in which 
domains V, VI, and the intracellular connecting loop from P2-AR were substituted into a2- 

25 AR was shown to bind ligands with a2-AR specificity, but to stimulate adenylate cyclase in 
the manner of P2-AR. This demonstrates that for adrenergic-type receptors, G-protein 
recognition is present in domains V and VI and their connecting loop. The opposite situation 
was predicted and observed for a chimera in which the V- > VI loop from al-AR replaced the 
corresponding domain on P2-AR and the resulting receptor bound ligands with p2-AR 

30 specificity and activated G-protein-mediated phosphatidylinositol turnover in the al-AR 

manner. Finally, chimeras constructed from muscarinic receptors also demonstrated that V- > 
VI loop is the major determinant for specificity of G-protein activity (Bolander FF, supra). 
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Chimeric or modified T7Gs containing substitutions in the extracellular and 
transmembrane regions have shown that these portions of the receptor determine ligand 
binding specificity. For example, two Ser residues conserved in domain V of all adrenergic 
and D catecholamine T7G receptors are necessary for potent agonist activity. These serines 
are believed to form hydrogen bonds with the catechol moiety of the agonists within the T7G 
binding site. Similarly, an Asp residue present in domain III of all T7Gs which bind biogenic 
amines is believed to form an ion pair with the ligand amine group in the T7G binding site. 

Functional, cloned T7Gs are expressed in heterologous expression systems and their 
biological activity assessed (e.g. Marullo et al (1988) Proc Natl Acad Sci 85:7551-55; King et 
al (1990) Science 250:121-23). One heterologous system introduces genes for a mammalian 
T7G and a mammalian G-protein into yeast cells. The T7G is shown to have appropriate 
ligand specificity and affinity and trigger appropriate biological activation-growth arrest and 
morphological changes-of the yeast cells. 



An alternate procedure for testing chimeric receptors is based on the procedure 
utilizing the P 2n purinergic receptor (PJ as published by Erb et al (1993, Proc Natl Acad Sci 
90:10441 1-53). Function is easily tested in cultured K562 human leukemia cells because 
these cells lack P 2u receptors. K562 cells are transfected with expression vectors containing 
20 either normal or chimeric P 2u and loaded with fura-a, fluorescent probe for Ca++. Activation 
of properly assembled and functional P 2u receptors with extracellular UTP or ATP mobilizes 
intracellular Ca++ which reacts with fura-a and is measured spectrofluorometrically. As 
with the T7G receptors above, chimeric genes are created by combining sequences for 
extracellular receptive segments of any newly discovered T7G polypeptide with the 
25 nucleotides for the transmembrane and intracellular segments of the known P 2u molecule. 
Bathing the transfected K562 cells in microwells containing appropriate ligands triggers 
binding and fluorescent activity defining effectors of the T7G molecule. Once ligand and 
function are established, the P 2u system is useful for defining antagonists or inhibitors which 
block binding and prevent such fluorescent reactions. 



30 



EXAMPLE 9: Production of HEDG-5 Specific Antibodies 

Two approaches are utilized to raise antibodies to HEDG-5, and each approach is 
35 useful for generating either polyclonal or monoclonal antibodies. In one approach, denatured 
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protein from reverse phase HPLC separation is obtained in quantities up to 75 mg. This 
denatured protein is used to immunize mice or rabbits using standard protocols; about 100 
micrograms are adequate for immunization of a mouse, while up to 1 mg might be used to 
immunize a rabbit. For identifying mouse hybridomas, the denatured protein is 
5 radioiodinated and used to screen potential murine B-cell hybridomas for those which 

produce antibody. This procedure requires only small quantities of protein, such that 20 mg 
is sufficient for labeling and screening of several thousand clones. 

In the second approach, the amino acid sequence of an appropriate HEDG-5 domain, 
10 as deduced from translation of the cDNA, is analyzed to determine regions of high 

antigenicity. Oligopeptides comprising appropriate hydrophilic regions are synthesized and 
used in suitable immunization protocols to raise antibodies. Analysis to select appropriate 
epitopes is described by Ausubel FM et al (supra). The optimal amino acid sequences for 
immunization are usually at the C-terminus, the N-terminus and those intervening, 
15 hydrophilic regions of the polypeptide which are likely to be exposed to the external 
environment when the protein is in its natural conformation. 

Typically, selected peptides, about 15 residues in length, are synthesized using an 
Applied Biosystems Peptide Synthesizer Model 431 A using frnoc-chemistry and coupled to 

20 keyhole limpet hemocyanin (KLH; Sigma, St. Louis MO) by reaction with M-maleimidoben- 
zoyl-N-hydroxysuccinimide ester (MBS; Ausubel FM et al, supra). If necessary, a cysteine is 
introduced at the N-terminus of the peptide to permit coupling to KLH. Rabbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant. The resulting 
antisera are tested for antipeptide activity by binding the peptide to plastic, blocking with 1% 

25 bovine sewm albumin, reacting with antisera, washing and reacting with labeled (radioactive 
or fluorescent), affinity purified, specific goat anti-rabbit lgG. 

Hybridomas are prepared and screened using standard techniques. Hybridomas of 
interest are detected by screening with labeled HEDG-5 to identify those fusions producing 
30 the monoclonal antibody with the desired specificity. In a typical protocol, wells of plates 
(FAST; Becton-Dickinson, Palo Alto CA) are coated during incubation with affinity purified, 
specific rabbit anti-mouse (or suitable antispecies lg) antibodies at 10 mg/ml. The coated 
wells are blocked with 1% BSA, washed and incubated with supematants from hybridomas. 
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After washing the wells are incubated with labeled HEDG-5 at 1 mg/ml. Supernatants with 
specific antibodies bind more labeled HEDG-5 than is detectable in the background. Then 
clones producing specific antibodies are expanded and subjected to two cycles of cloning at 
limiting dilution. Cloned hybridomas are injected into pristane-treated mice to produce 

5 ascites, and monoclonal antibody is purified from mouse ascetic fluid by affinity 

chromatography on Protein A. Monoclonal antibodies with affinities of at least 10 8 M-\ 
preferably 10 9 to 10 10 or stronger, are typically made by standard procedures as described in 
Harlow and Lane (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY; and in Goding (1986) Monoclonal Antibodies: Principles and 

1 0 Practice, Academic Press, New York City, both incorporated herein by reference. 

EXAMPLE 10: Diagnostic Test Using HEDG-5 Specific Antibodies 

Particular HEDG-5 antibodies are useful for investigating signal transduction and the 
1 5 diagnosis of infectious or hereditary conditions which are characterized by differences in the 
amount or distribution of HEDG-5 or downstream products of an active signaling cascade. 

Diagnostic tests for HEDG-5 include methods utilizing antibody and a label to detect 
HEDG-5 in human body fluids, membranes, cells, tissues or extracts of such. The 

20 polypeptides and antibodies of the present invention are used with or without modification. 
Frequently, the polypeptides and antibodies are labeled by joining them, either covalently or 
noncovalently, with a substance which provides for a detectable signal. A wide variety of 
labels and conjugation techniques are known and have been reported extensively in both the 
scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, 

25 cofactors, inhibitors, fluorescent agents, chemiluminescent agents, chromogenic agents, 
magnetic particles and the like. Patents teaching the use of such labels include US Patent 
No's. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. 
Also, recombinant immunoglobulins may be produced as shown in US Patent No.4,816,567, 
Incorporated herein by reference. 



30 



A variety of protocols for measuring soluble or membrane-bound HEDG-5, usmg 
either polyclonal or monoclonal antibodies specific for the protein, are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA) 
and fluorescent activated cell sorting (FACS). A two-site monoclonal-based immunoassay 
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utilizing monoclonal antibodies reactive to two non-interfering epitopes on HEDG-5 is 
preferred, but a competitive binding assay may be employed. These assays are described, 
among other places, in Maddox, DE et al (1 983, J Exp. Med. 158:121 If). 

5 EXAMPLE 1 1 : Purification of Native HEDG-5 Using Specific Antibodies 

Native or recombinant HEDG-5 is purified by immunoaffmity chromatography using 
antibodies specific for HEDG-5. In general, an immunoaffinity column is constructed by 
covalently coupling the anti-TRH antibody to an activated chromatographic resin. 

10 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation 
with ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB 
Biotechnology, Piscataway NJ). Likewise, monoclonal antibodies are prepared from mouse 
ascites fluid by ammonium sulfate precipitation or chromatography on immobilized Protein 
1 5 A. Partially purified immunoglobulin is covalently attached to a chmmatographic resin such 
as CnBr-activated Sepharose (Pharmacia LKB Biotechnology). The antibody is coupled to 
the resin, the resin is blocked, and the derivative resin is washed according to the 
manufacturer's instructions. 

20 Such immunoaffmity columns are utilized in the purification of HEDG-5 by preparing 

a fraction from cells containing HEDG-5 in a soluble form. This preparation is derived by 
solubilization of whole cells or of a subcellular fraction obtained via differential 
centrifugation (with or without addition of detergent) or by other methods well known in the 
art. Alternatively, soluble HEDG-5 containing a signal sequence is secreted in useful 

25 quantity into the medium in which the cells are grown. 

A soluble HEDG-5-containing preparation is passed over the immunoaffinity column, 
and the column is washed under conditions that allow the preferential absorbance of HEDG-5 
(e.g., high ionic strength buffers in the presence of detergent). Then, the column is eluted 
30 under conditions that disrupt antibody/protein binding (e.g., a buffer of pH 2-3 or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and HEDG-5 is collected. 



SDOCID: <WO 9933972A1_L> 



48 



WO 99/33972 PCT/CA98/01193 
EXAMPLE 12: Drug Screening 

This invention is particularly useful for screening therapeutic compounds by using 
HEDG-5 or binding fragments thereof in any of a variety of drug screening techniques. As 

5 HEDG-5 is a G protein coupled receptor any of the methods commonly used in the art may 
potentially used to identify HEDG-5 ligands. For example, the activity of a G protein 
coupled receptor such as EDG-5 can be measured using any of a variety of appropriate 
functional assays in which activation of the receptor results in an observable change in the 
level of some second messenger system, such as adenylate cyclase, guanylyl cyclase, calcium 

1 0 mobilization, or inositol phospholipid hydrolysis. More particularly, activation of EDG-5 can 
be measured using the NF-kB, SRE and/or AP-1 functional assays, as described above. One 
approach, measures the effect of ligand binding on the activation of intracellular second 
messenger pathways, using a reporter gene. Typically, the reporter gene will have a promoter 
which is sensitive to the level of that second messenger controlling expression of an easily 

1 5 detectable gene product, for example, CAT or luciferase. Alternatively, the cell is loaded 
with a reporter substance, e.g., FURA whereby changes in the intracellular concentration of 
calcium indicate modulation of the receptor as a result of ligand binding. Thus, the present 
invention provides methods of screening for drugs or any other agents which affect signal 
transduction. 

20 

Alternatively, the polypeptide or fragment employed in such a test is either free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells (or membrane 
preparations therefrom) which are stably transformed with recombinant nucleic acids 

25 expressing the polypeptide or fragment. Drugs are screened against such transformed cells in 
competition binding assays. 32 P-labelled LPA could be used in such a competition binding 
assay for HEDG-5. Such cells, either in viable or fixed form, are used for standard binding 
assays. One measures, for example, the formation of complexes between HEDG-5 and the 
agent being tested. Alternatively, one examines the diminution in complex formation 

30 between HEDG-5 and a ligand, for example LPA, caused by the agent being tested. 

EXAMPLE 13: Rational Drug Design 

Herein, the goal of rational drug design is to produce structural analogs of biologically 
35 active phospholipids of interest or of small molecules with which they interact, agonists, 
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antagonists, or inhibitors. Any of these examples are used to fashion drugs which are more 
active or stable forms of the phospholipid or which enhance or interfere with the function of a 
phospholipid in vivo. 

5 In one approach, the three-dimensional structure of a protein of interest, or of a 

protein-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, 
most typically, by a combination of the two approaches. Both the shape and charges of the 
polypeptide must be ascertained to elucidate the structure and to determine active site(s) of 
the molecule. Less often, useful information regarding the structure of a polypeptide is 

10 gained by modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design efficient inhibitors. Useful examples of rational drug 
design includes molecules which have improved activity or stability as shown by Braxton S 
and Wells JA (1992, Biochemistry 3 1 :7796-7801) or which act as inhibitors, agonists, or 
antagonists of native peptides as shown by Athauda SB et al (1993 J Biochem 1 13:742-46), 

1 5 incorporated herein by reference. 

EXAMPLE 14: Use and Administration of Antibodies, Inhibitors, or Antagonists 

Antibodies, inhibitors, or antagonists of HEDG-5 (or other treatments to limit signal 
20 transduction, LST) provide different effects when administered therapeutically. LSTs are 
formulated in a nontoxic, inert, pharmaceutically acceptable aqueous carrier medium 
preferably at a pH of about 5 to 8, more preferably 6 to 8, although pH may vary according to 
the characteristics of the antibody, inhibitor, or antagonist being formulated and the condition 
to be treated. Characteristics of LSTs include solubility of the molecule, half-life and 
25 antigenicity/immunogenicity. These and other characteristics aid in defining an effective 
carrier. 

LSTs are delivered by known routes of administration including but not limited to 
topical creams and gels; transmucosal spray and aerosol; transdermal patch and bandage; 
30 injectable, intravenous and lavage formulations; and orally administered liquids and pills 
particularly formulated to resist stomach acid and enzymes. The particular formulation, exact 
dosage, and route of administration is determined by the attending physician and varies 
according to each specific situation. 
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Such determinations are made by considering multiple variables such as the condition 
to be treated, the LST to be administered, and the pharmacokinetic profile of a particular 
LST. Additional factors which are taken into account include severity of the disease state, 
patient's age, weight, gender and diet, time and frequency of LST administration, possible 
5 combination with other drugs, reaction sensitivities, and tolerance/response to therapy. Long 
acting LST formulations might be administered every 3 to 4 days, every week, or once every 
two weeks depending on half-life and clearance rate of the particular LST. 

Normal dosage amounts vary from 0.1 to 100,000 micrograms, up to a total dose of 
10 about 1 g, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature; see US Patent Nos. 4,657,760; 5,206,344; or 
5,225,212. Those skilled in the art employ different formulations for different LSTs. 
Administration to cells such as nerve cells necessitates delivery in a manner different from 
that to other cells such as vascular endothelial cells. 

15 

It is contemplated that abnormal signal transduction, trauma, or diseases which trigger 
HEDG-5 activity are treatable with LSTs. These conditions or diseases are specifically 
diagnosed by the tests discussed above, and such testing should be performed in suspected 
cases of viral, bacterial or fungal infections: allergic responses; mechanical injury associated 
20 with trauma; hereditary diseases; lymphoma or carcinoma; or other conditions which activate 
the genes of lymphoid or neuronal tissues. 

EXAMPLE IS: Production of Transgenic Animals 

25 

Animal model systems which elucidate the physiological and behavioral roles of the 
HEDG-5 receptor are produced by creating transgenic animals in which the activity of the 
HEDG-5 receptor is either increased or decreased, or the amino acid sequence of the 
expressed HEDG-5 receptor is altered, by a variety of techniques. Examples of these 

30 techniques include, but are not limited to: 1) Insertion of normal or mutant versions of DNA 
encoding a HEDG-5 receptor, by microinjection, electroporation, retroviral transfection or 
other means well known to those skilled in the art, into appropriate fertilized embryos in 
order to produce a transgenic animal or 2) Homologous recombination of mutant or normal, 
human or animal versions of these genes with the native gene locus in transgenic animals to 

35 alter the regulation of expression or the structure of these HEDG-5 receptor sequences. The 
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technique of homologous recombination is well known in the art. It replaces the native gene 
with the inserted gene and so is useful for producing an animal that cannot express native 
HEDG-5 receptors but does express, for example, an inserted mutant HEDG-5 receptor, 
which has replaced the native HEDG-5 receptor in the animal's genome by recombination, 
5 resulting in underexpression of the transporter. Microinjection adds genes to the genome, but 
does not remove them, and so is useful for producing an animal which expresses its own and 
added HEDG-5 receptors, resulting in overexpression of the HEDG-5 receptors. 

One means available for producing a transgenic animal, with a mouse as an example, 
10 is as follows: Female mice are mated, and the resulting fertilized eggs are dissected out of 
their oviducts. The eggs are stored in an appropriate medium such as M2 medium. DNA or 
cDNA encoding a HEDG-5 purified from a vector by methods well known in the art. 
Inducible promoters may be fused with the coding region of the DNA to provide an 
experimental means to regulate expression of the transgene. Alternatively or in addition, 
15 tissue specific regulatory elements may be fused with the coding region to permit tissue- 
specific expression of the trans-gene. The DNA, in an appropriately buffered solution, is put 
into a microinjection needle (which may be made from capillary tubing using a piper puller) 
and the egg to be injected is put in a depression slide. The needle is inserted into the 
pronucleus of the egg, and the DNA solution is injected. The injected egg is then transferred 
20 into the oviduct of a pseudopregnant mouse ( a mouse stimulated by the appropriate 

hormones to maintain pregnancy but which is not actually pregnant), where it proceeds to the 
uterus, implants, and develops to term. As noted above, microinjection is not the only 
methods for inserting DNA into the egg cell, and is used here only for exemplary purposes. 

25 EXAMPLE 16: Isolation, Chromosomal Localization and Partial Sequencing of a hedg- 
5 Genomic Clone 

To identify genomic clones containing the hedg-5 gene, the H501-20F (SEQ ID NO: 
6) and H501-246R (SEQ ID NO: 7) primers were used to amplify human genomic DNA as 
30 described in Example 2. One microliter of human genomic DNA (Clontech; Cat #6550-1) 

was used as template. The PCR product was purified and sequenced in-house, using the PCR 
primers to prime the sequencing reactions. The sequence of this product (see SEQ ID. NO: 
12) matched the cDNA sequence previously obtained for hedg-5 (see SEQ ID. NO: 13), 
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indicating that these primers could be used to identify genomic clones containing this region 
ofthehedg-5 gene. 

An arrayed library of genomic DNA clones (Genome Sciences Inc.) was screened by 
PCR using these primers. The library contained bacterial artificial chromosome (BAC) 

5 constructs with -120 kb human genomic DNA inserts. In total, clones representing about 3 
haploid genome equivalents were screened using the edg-5 diagnostic PCR primers. Two 
clones were identified by this method: BAC-28 (IF) and BAC-236 (13M). Once the DNA 
from these clones was received, their identity was verified in-house by sequencing of the 
PCR product we obtained using the edg-5 diagnostic primers: this analysis showed both 

10 clones represent at least part of the hedg-5 gene. The BAC-28 (IF) clone was subsequently 
used to localize the gene on human chromosomes by fluorescent in situ hybridization (FISH) 
at Genome Systems Inc. The locus for the hedg-5 gene mapped to band P 22.3 of human 
chromosome 1. 

1 5 A search of the on-line Mendelian Inheritance in Man database revealed two entries 

for inherited diseases which genetically map to this region, but for which genes have not yet 
been cloned. These were the database entries 154280 (Malignant Transformation 
Suppression-1 or MTS1) and 157900 (Moebius Syndrome). The first represents a dominant 
suppresser of cellular transformation (a class of genes called tumor suppressers or anti- 

20 oncogenes), while the second is an inherited syndrome in which the sixth and seventh cranial 
nerves are small or absent, leading to facial paralysis. Whether edg-5 gene defects contribute 
to either of these phenotypes is not known. 

Sequencing was performed on DNA prepared from BAC-28 (IF) to determine the 
25 position(s) of introns (if any) within the coding region of the edg-5 gene. Sequencing results 
showed that only one intron exists within the coding region of hedg-5, at a position indicated 
by the arrowhead between nt 996/997 of the sequence shown in Figure 4A. This intron falls 
within the codon for Gly-246 of the edg-5 amino acid sequence. Additional sequencing was 
performed in the region flanking the 5' end of the edg-5 cDNA sequence derived from pC3- 
30 hedg-55, revealing 250 bp of genomic DNA sequence upstream of the 5' end of the cDNA. 
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EXAMPLE 17: Expression and tissue distribution of Edg-5 RNA in the rat. 

Northern blotting was carried out with the edg-5 cDNA insert by techniques well- 
known in the art. Two different multi-tissue rat RNA blots (Origene .Cat. MB- 1005 and MB- 
5 1007) were probed with radiolabeled edg-5 cDNA. Washing was performed at high 

stringency conditions that do not permit detection of edg-2 or other related transcripts. The 
blots were then subjected to autoradiography. The Northern blot results show that RNA 
expression levels are highest in lung, kidney and testis. Lower RNA levels were seen in skin, 
heart, small intestine and stomach. Little or no detectable RNA was found in thymus, brain, 
10 spleen and liver. Muscle tissue may also express low levels of edg-5 mRNA. Further, anti- 
sense oligonucletide probes based on the hedg-5 sequence disclosed herein can be used by 
those of skill in the art to for in situ hybridization expression studies. 

Various modifications and variations of the described method and system of the 
15 invention will be apparent to those skilled in the art without departing from the scope and 

spirit of the invention. Although the invention has been described in connection with specific 
preferred embodiments, it should be understood that the invention as claimed should not be 
unduly limited to such specific embodiments. 

20 
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CLAIMS 

1 An isolated nucleotide sequence encoding a mammalian EDG-5 receptor or 
biologically active fragment thereof 

5 

2. The isolated nucleotide sequence of claim 1 encoding a murine EDG-5 receptor or 
biologically active fragment thereof. 

3. The isolated* nucleotide sequence of claim 2 encoding a murine EDG-5 receptor of 
10 Figure IB or biologically active fragment thereof. 

4. The isolated nucleotide sequence of claim 1 encoding a human EDG-5 receptor or 
biologically active fragment thereof. 

15 5. The isolated nucleotide sequence of claim 1 wherein the biologically active fragment 
is activated by LP A. 

6. An isolated nucleotide sequence selected from the group consisting of: 

(a) the nucleotide sequence comprising nucleotides 36-1907 of SEQ. ID NO: 12 
20 (b) the nucleotide sequence of Figure 3B; 

(c) the nucleotide sequence of Figure 3C; 

(d) a nucleotide sequence comprising at least about 70% sequence identity to (a), (b) 
or (c) and which hybridizes under stringent conditions to the nucleotide sequence of (a), (b) 
or (c) ? respectively; and 

25 (e) the nucleotide sequence which encodes the amino acid sequence of Figure 4 A, 4B 

or4C. 

7. The isolated nucleotide sequence of Claim 6 wherein the nucleotide sequence is 
selected from the group consisting of: 

30 (1) the nucleotide sequence of (a), (b), (c) or (e) of claim 6; and 

(2) the nucleotide sequence of (d) of claim 6 wherein the nucleotide sequence has at 
least about 80-85% sequence identity to the nucleotide sequence of (a), (b) or (c) of claim 6. 
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8. The isolated nucleotide sequence of Claim 6 wherein the nucleotide sequence is 
selected the group consisting of: 

(1) the nucleotide sequence of (a), (b), (c) or (e) of claim 6; and 

(2) the nucleotide sequence of (d) of claim 6 wherein the nucleotide sequence has at 
least about 95% sequence identity to the nucleotide sequence of (a), (b) or (c) of claim 
6. 

9. The complement of the nucleotide sequence of Claim 8. 

10. An expression vector comprising the nucleotide sequence of Claim 8. 

11. A host cell comprising the expression vector of Claim 1 0. 

12. The isolated and purified amino acid sequence for the HEDG-5 receptor encoded by 
the nucleotide sequence of claim 8. 

13. The isolated and purified amino acid sequence of claim 12 comprising the amino acid 
sequence of SEQ. ID NO: 13 (Figure 4A), Figure 4B or Figure 4C or a biological acitve 
portion thereof. 

14. The isolated nucleotide sequence of Claim 6 wherein the nucleotide sequence is 
selected from the group consisting of the nucleotide sequence which encodes the amino acid 
sequence of SEQ ID NO: 13 (Figure 4A), Figure 4B and Figure 4C. 

15. A hybridization probe of the nucleotide sequence of Claim 5 . 

16. A method of screening compounds to identify HEDG-5 ligands comprising the steps 
of: 

(a) culturing cells which express the HEDG-5 receptor or with a membrane 
preparation obtained therefrom; and 

(b) contacting said compound with said cells or said membrane preparation; and 

(c) determining whether binding between the HEDG-5 receptor and the candidate 
ligand has occurred. 



56 

SUBSTITUTE SHEET (Rule 26) 



WO 99/33972 

17. A HEDG-5 ligand identified by the method of claim 16. 
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18. A method of screening compounds to identify HEDG-5 antagonists comprising the 
steps of: 

5 (a) culturing cells which express the HEDG-5 receptor or with a membrane 

preparation obtained therefrom; 

(b) contacting said cells or said membrane preparation with a mixture comprising 
an agonist and said compound to be tested for antagonist activity at said receptor; and 
(3) determining the degree of antagonist activity by measuring a response 
1 0 indicative of the degree of binding between said agonist and said receptor and 

comparing this measured response with a standard response for binding between said 
agonist and said receptor absent the antagonist. 

19. The method of claim 1 8 wherein said agonist is LP A. 

15 

20. An antagonist identified by the method of claim 18. 
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The present invention is directed to nucleic acid sequences and amino acid sequences 
for mammalian EDG-5 receptor homologs, and particularly for human EDG-5 receptor 
5 homologs. The invention also provides methods for determining agonists and antagonists for 
EDG-5 receptors in addition to assays, expression vectors, host cells and methods for treating 
disorders associated with aberrant expression or activity of EDG-5. SIP and SPC are 
agonists for EDG-5 receptors. 



10 



vJSDOCID: <WO 9933972A1_I_> 



58 

SUBSTITUTE SHEET (Rule 26) 



WO 99/33972 PCT/CA98/01193 

Figure 1A: DNA sequence of the murine edg-5 RT-PCR clone 501 
(SEQ ID NO: 3) 



1 


AACACTGGCC 


CGGTGTCGAA 


AACGTTGACC 


GTCAACCGCT 


GGTTCCTCCG 


51 


CCAGGGGCTC 


CTAGACACCA 


GCCTGACTGC 


CTCCCTGGCC 


AATTTGCTGG 


101 


TTATTGCTGT 


GGAAAGACAC 


ATGTCNATCA 


TGAGGATGAG 


AGTCCACAGC 


151 


AACTTGACCA 


AAAAGCGGGT 


GACGCTGCTC 


ATTCTGCTGG 


TGTGGGCCAT 


201 


CGCCATCTTC 


ATGGGGG CCG 


TCCCCACNCT 


GGGATGGAAT 


TGCCTCTGCA 


251 


ACATCTCGGC 


CTGCTCTTCT 


CTGGCTCCCA 


TTTACAGTAG 


GAGTTACCTC 


301 


ATTTTCTGGA 


CTGTGTCCAA 


CCTCCTGGCC 


TTCTTCATCA 


TGGTGGCGGT 


351 


ATACGTACGC 


ATCTACATGT 


ATGTTAAAAG 


GAAAACCAAC 


GTCTTATCTC 


401 


CACACACCAG 


TGGCTCCATC 


AGCCGCCGGA 


GGGCTCCCAT 


GAAGCTAATG 


451 


AAGACAGTGA 


TGACCGTCTT 


AGGCGCCTTC 


GTGGTGTGCT 


GGACCCCGGG 


501 


TCTGGTGGTT 


CTGCTGCTGG 


ACGGCCTGAA 


CTGCAAGCAG 


TGTAACGTGC 


551 


AACACGTGAA 


GNGCTGGTTC 


CTGCTGCTCG 


CACTGCTCAA 


CTCCGTCATG 


601 


AACCCCCTCA 


TCTACTGCCG 


CTCTCCNNAC 


TTTCCATGG 





Figure IB: Sequence of full-length xnEDG-5 cDNA insert and 
alignment with MEDG-5 amino acid sequence. Translation starts 
at nt 19. Translation termination codon is located at nt 
1081-1083 

MNECHYDKRMDFFY 
GCACAGTTCTTGTCCACCATGAATGAGTGTCACTATGACAAGCGCATGGACTTTTTCTAC 
1 + + + + + + 60 

NRSNTDTADEWTGTKLV I V L 
AACAGGAGCAACACAGACACAGCGGACGAGTGGACAGGGACAAAGCTTGTGATCGTCCTG 



61 + + ~ + + + 120 

CVGTFFCLFIFFSNSLV IAA 
TGCGTGGGGACGTTCTTCTGCCTCTTTATATTTTTTTCTAACTCCCTGGTCATTGCTGCG 
121 -- + + + ---+ +— + 180 

VITNRKFHFPFYYLLANLAA 
GTGATCACAAACCGGAAGTTCCACTTTCCCTTCTACTACCTGCTGGCTAACTTAGCTGCT 
181 - + + + + — + + 240 

ADFFAG IAYVFLMFNTG PVS 
GCGGATTTCTTCGCCGGAATCGCTTACGTGTTCCTGATGTTTAACACTGGCCCGGTGTCG 
24i + + + + + + 300 

KTLTVNRWFLRQGLLDTSLT 
AAAACGTTGACCGTCAACCGCTGGTTCCTCCGCCAGGGGCTCCTAGACACCAGCCTGACT 
301 + + + + + 360 
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Figure IB (cont.) 

ASLANLLVIAVERHMS IMRM 
GCCTCCCTGGCCAATTTGCTGGTTATTGCTGTGGAAAGACACATGTCAATCATGAGGATG 
361 + + + -+ + + 420 

RVHSNLTKKRVTLLILLVWA 
AGAGTCCACAGCAACTTGACCAAAAAGCGGGTGACGCTGCTCATTCTGCTGGTGTGGGCC 
421 + + + + + + 480 

IAIFMGAVPTLGWNCL CNIS 
ATCGCCATCTTCATGGGGGCCGTCCCCACGCTGGGATGGAATTGCCTCTGCAACATCTCG 
481 + + + + + + 540 

ACSSLAPIYSRSYL IFWTVS 

GCCTGCTCTTCTCTGGCTCCCATTTACAGTAGGAGTTACCTCATTTTCTGGACTGTGTCC 
541 + + + + + + 600 

NLLAFFI MVAVYVRIYMYVK 
AACCTCCTGGCCTTCTTCATCATGGTGGCGGTATACGTACGCATCTACATGTATGTTAAA 
601 + + + + + -- + 660 

RKTNVLS PHTSGSISRRRAP 
AGGAAAACCAACGTCTTATCTCCACACACCAGTGGCTCCATCAGCCGCCGGAGGGCTCCC 
661 - + + - + -+ + + 720 

MKLMKTVMTVLGAFVVCWTP 
ATGAAGCTAATGAAGACAGTGATGACCGTCTTAGGCGCCTTCGTGGTGTGCTGGACCCCG 
721 - + ---+ + + + + 780 

GLVVLLLDGLNCKQCNVQHV 
GGTCTGGTGGTTCTGCTGCTGGACGGCCTGAACTGCAAGCAGTGTAACGTGCAACACGTG 
781 + + + + + --- + 840 

KRWFLLLALLNSVMNPI IYS 
AAGCGCTGGTTCCTGCTGCTCGCACTGCTCAACTCCGTCATGAACCCCATCATCTACTCG 
841 + + + + + + 900 

Y KDEDMYNTMRKM ICCALQD 
TACAAGGACGAGGACATGTACAACACCATGCGGAAGATGATCTGCTGTGCCCTGCAGGAC 
901 + + + + + + 96O 

SNTERRPSRNPSTIHSRSET 
AGCAATACCGAGAGGCGCCCCTCCCGCAACCCCTCCACCATCCACAG CAGGAG CGAGACG 
961 + + + + - -+ 1020 

GSQYLEDS ISQGPVCNKNGS 
GGCAGCCAGTACCTGGAGGACAGCATCAGCCAGGGCCCGGTGTGCAATAAAAACGGCTCC 
1021 + + + + + ---+ 1080 

* 

TAAGCCACGGACGCCTCCGCCCTCTTCCCCTGGGGAAAGAGCTGTTAAGCGTCCTCACCT 
1081 + + -+ + + + H40 

GTCTCACAAAGCACGTGGACAGGGTTGTTTGAGGGCTCCATGCATCACTTCTGGGGCTTT 
1141 - + + - + + -+ + 1200 
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Figure IB (cont.) 

TAAGTTTTCATGGTCAAGGAAAATAGATTTACGGCGTTTAGTAAAGCGCACAGGAAAGGG 
1201 + + + + + + 1260 

AGAGATGAGCAGTGGGTTCCGGCTTGTCTGTGATCCGCTCCCAACATCCTCCAGCTCTTG 
1261 + + + + + + 1320 

CGAGAGCATGCTGGGCTCTGTCACCATCTTGCCACCATTGTCTGTGTGTTTTCAATGATG 
1321 + + + + + -- + 1380 

GTGTTGAAAGTCCTAGGTCAAAAGAAAGTAGTAAATAATGGTACCTGAGCCCCCCATTGT 
1381 + + + + + + 1440 

GTGGCTACTAGATTCTGTAGTTGTTTCCGCATGGGTTTAAAATGTTCAGAAAAATATTTT 
1441 + + + + + + 1500 

AG CAGTGAACTTTGATTTCCTCAGAGAAGCCATGGC CAGGAG CTAGGTGGG CAACTGTAT 
1501 + + + + - + + 1560 

AGTAGAGTAAGTGATGATATTGACCGGTAGGTTGAACTTCTTCCAAATAGCGTCAAATAT 
1561 + + + + + + 1620 

GAGCACGATTAGATCTTCAGTCTTGGTTATCAGGATACCGCTGAGGGGCTTGCTGGATCC 
1621 + + + + + + 1680 

CAAGTGCAAAGTAATTGCACATCGAGTATTTTAACCAAAGCTGCCAGCGTATTCTATCTT 
1681 - + + -t + + + 1740 

GTGGACTGCATTTTGATCTTGTATTTTTCTCCTTCAAAGACCTCTGAAAGGTAGATCAGT 
1741 + + + + -+ + 1800 

TAAAAACAAAAATAGTGTTCATACACATAGGCTACTGACCAGTGTTTTCGGTGTAAGACG 
1801 + + + -- -+ -+ + I860 

TTTAGAGTGTATCTGACAAAGTAAGT^ATAACTTCAAGGCAGGCACTATGGTATTTATGTA 
1861 + + + + + + 1920 

GCTTGCAAACGTTTACATGTTCTCTCTCTCTCTCTCTCTCCCCTCTGCTGTTGTGATGTA 
1921 -- + + + + + + 1980 

ACATTTATGTGCACAAACTACTTGTAATAAAATATTTTAAGAAGCAAAAAAAA 
1981 + + +- -- + -- + 2033 



Figure 2: Predicted amino acid sequence of Mouse partial ED6-5 
cDNA. X represents an amino acid which cannot be assigned due 
to poor sequencing information from direct PCR sequencing 

1 NTGPVSKTLT VNRWFLRQGL LDTSLTASLA NLLVIAVERH MSIMRMRVHS 

51 NLTKKRVTLL ILLVWAIAIF MGAVPTLGWN CLCNISACSS LAPIYSRSYL 

101 IFWTVSNLLA FFIMVAVYVR IYMYVKRKTN VLSPHTSGSI SRRRAPMKLM 

151 KTVMTVLGAF WCWTPGLW LLLDGLNCKQ CNVQHVKXWF LLLALLNSVM 

201 NPLIYCRSPX FPW 
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Figure 3A: Nucleotide sequence of a hEDG-5 cDNA inserted into 
pcDNA3 (SEQ ID NO: 13) 

Start and stop codons underlined. 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



gaattcgcgg 
TATGACAAGC 
CGATGACTGG 
TTTTCTGCCT 
ATCAAAAACA 
AGCTGCTGCC 
ACACAGGCCC 
CAGGGGCTTC 
TATCGCCGTG 
ACCTGACCAA 
GCCATTTTTA 
CATCTCTGCC 
TTTTCTGGAC 
TACCTGCGGA 
GCATACAAGT 
AGACGGTGAT 
CTGGTGGTTC 
GCATGTGAAA 
ACCCCATCAT 
AAGATGATCT 
CATCCCCTCC 
AGGATAGTAT 
CTGGATGCCT 
GAATGATTAC 
TCCATTAATC 
AAGCATGGGC 
cgacgcggcc 
catgcat 



ccgcgtcgac 
ACATGGACTT 
ACAGGAACAA 
GTTTATTTTT 
GAAAATTTCA 
GATTTCTTCG 
AGTTTCAAAA 
TGGACAGTAG 
GAGAGGCACA 
AAAGAGGGTG 
TGGGGGCGGT 
TGCTCTTCCC 
AGTGTCCAAC 
TCTACGTGTA 
GGGTCCATCA 
GACTGTCTTA 
TGCCCCTCGA 
AGGTGGTTCC 
CTACTCCTAC 
GCTGCTTCTC 
ACAGTCCTCA 
TAGCCAAGGT 
CTYGGCCCAC 
CTGTCTCTAA 
ACTGCTAGAT 
AGTAAAGAGA 
gcgaattctt 



gttcaCTTCT 
TTTTTATAAT 
AGCTTGTGAT 
TTTTCTAATT 
TTTCCCCTTT 
CTGGAATTGC 
ACTTTGACTG 
CTTGACTGCT 
TGTCAATCAT 
ACACTGCTCA 
CCCCACACTG 
TGGCCCCCAT 
CTCATGGCCT 
CGTCAAGAGG 
GCCGCCGGAG 
GGGGCGTTTG 
CGGCCTGAAC 
TGCTGCTGGC 
AAGGACGAGG 
TCAGGAGAAC 
GCAGGAGTGA 
GCAGTCTGCA 
CCAGGCCTCC 
CAAAGCCCAT 
TTCTTTAAAA 
GGACCTGCTG 
ttgcttttta 



CCACAATGAA 
AGGAGCAACA 
TGTTTTGTGT 
CTCTGGTCAT 
TACTACCTGT 
CTATGTATTC 
TCAACCGCTG 
TCCCTCACCA 
GAGGATGCGG 
TTTTGCTTGT 
GGCTGGAATT 
TTACAGCAGG 
TCCTCATCAT 
AAAACCAACG 
GACACCCATG 
TGGTATGCTG 
TGCAGGCAGT 
GCTGCTCAAC 
ACATGTATGG 
CCAGAGAGGC 
CACAGGCAGC 
ATAAAAGCAC 
TCTGGGAAAA 
GTACAGTGTT 
AATTTTTTTT 
CATTTAGAGA 
ccctggaaga 



TGAGTGTCAC 
CTGATACTGT 
GTTGGGACGT 
CGCGGCAGTG 
TGGCTAATTT 
CTGATGTTTA 
GTTTCTCCGT 
ACTTGCTGGT 
GTCCATAGCA 
CTGGGCCATC 
GCCTCTGCAA 
AGTTACCTTG 
GGTTGTGGTG 
TCTTGTCTCC 
AAGCTAATGA 
GACCCCGGGC 
GTGGCGTGCA 
TCCGTCGTGA 
CACCATGAAG 
GTCCCTCTCG 
CAGTACATAG 
TTCCTAAACT 
GAGCTGTTAA 
ATTTGAGGTC 
CATAGTTTAA 
AAGCACAGgt 
aatactcgag 
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Figure 3B: cDNA sequence of clone pC3-hEDG5#3 .4 from the 
region encoding a hEDG5 polypeptide 

1 ATGAATGAGT GTCACTATGA CAAGCACATG GACTTTTTTT ATAATAGGAG 

51 CAACACTGAT ACTGTCGATG ACTGGACTGG AACAAAGCTT GTGATTGTTT 

101 TGTGTGTTGG GACGTTTTTC TGCCTGTTTA TTTTTTTTTC TAATTCTCTG 

151 GTCATCGCGG CAGTGATCAA AAACAGAAAA TTTCATTTCC CCTTCTACTA 

201 CCTGTTGGCT AATTTGGCTG CTGCCGATTT CTTCGCTGGA ATTGCCTATG 

251 TATTCCTGAT GTTTAACACA GGCCCAGTTT CAAAAACTTT GACTGTCAAC 

301 CGCTGGTTTC TCCGTCAGGG GCTTCTGGAC AGTAGCTTGA CTGCTTCCCT 

351 CACCAACTTG CTGGTTATCG CCGTGGAGAG GCACATGTCA ATCATGAgGA 

4 01 TGCGGGTCCA TAGCAACCTG ACCAAAAAGA GGGTGACACT GCTCATTTTG 

451 CTTGTCTGGG CCATCGCCAT TTTTATGGGG GCGGTCCCCA CACTGGGCTG 

501 GAATTGCCTC TGCAAcATCT CTGCCTGcTC TTCCCTGGCC CCCATTTACA 

551 GCAGGAGTTA CCTTGTTTTc TGGACAGTGT CCAACCTCAT GGCCTTCCTc 

601 ATCATGGTTG TGGTGTACCT GCGGATCTAC GTGTACGTCA AGAGGAAAAC 

651 CAACGTCTTG TCTCCGCATA CAAGTGGGTC CATCAGCCGC CGGAGGACAC 

701 CCATGAAGCT AATGAAGACG GTAATGACTG TCTTAGGGGC GTTTGTGGTA 

751 TGCTGGACCC CGGGCCTGGT GGTTCTGCTC CTCGACGGCC TGAACTGCAG 

801 GCAGTGTGGC GTGCAGCATG TGAAAAGGTG GTTCCTGCTG CTGGCGCTGC 

851 TCAACTCCGT CGTGAACCCT ATCATCTACT CCTACAAGGA CGAGGACATG 

901 TATGGCACCA TGAAGAAGAT GATCTGCTGC TTCTCTCAGG AGAACCCAGA 

951 GAGGCGTCCC TCTCGCATCC CCTCCACAGT CCTCAGCAGG AGTGACACAG 

1001 GCAGCCAGTA CATAGAGGAT AGTATTAGCC AAGGTGCAGT CTGCAATAAA 

1051 AGCACTTCCT AA 
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Figure 3C: cDNA sequence of clone pC3-hEDG5#28 from the region 
encoding a hEDG5 polypeptide. 

1 ATGAATGAGT GTCACTATGA CAAGCACATG GAC T TTTTTT ATAATAGGGG 

51 CAACACTGAT ACTGTCGATG ACTGGACAGG AACAAAGCTT GTGATTGTTT 

101 TGTGTGTtGG GACGTTTTTC TGCCTGTTTA TTTTTTTTTC TAATTCTCTG 

151 GTCATCGCGG CAGTGATCAA AAACAGAAAA TTTCATTTCC CCTTCTACTA 

201 CCTGTTGGCT AATTTAGCTG CTGCCGATTT CTTCGCTGGA ATTGCCTATG 

251 TATTCCTGAT GTTTAACACA GGCCCAGTTT CAAAAACTTT GACTGTCAAC 

301 CGCTGGTTTC TCCGTCAGGG GCTTCTGGAC AGTAGCTTGA CTGCTTCCCT 

351 CACCAACTTG CTGGTTATCG CCGTGGAGAG GCACATGTCA ATCATGAGGA 

401 TGCGGGTCCA TAGCAACCTG ACCAAAAAGA GGGTGACACT GCTCATTTTG 

451 CTTGTCTGGG CCATCGCCAT TTTTATGGGG GCGGTCCCCA CACTGGGCtG 

501 GAATTGCCTC TGCAAcATcT CTGCCTGcTC TTCCCTGGCC CCCATTTACA 

551 GCAGGAGTTA CCTTGTTTTC TGGACAGTGT CcAACCTCAT GGCcTTCCTC 

601 ATCATGGTTG TGGTGTACCT GCGGATCtAC GTGtACGTCA AGAGGAAAAC 

651 CAAcGTCTTG TCTCCGCAtA CAAGTGGGTC CATCAGCCGC CGGAGGACAC 

701 CCATGAAGCT AATGAAGACG GTGATGACTG TCTTAGGGGC GTTTGTGGTA 

751 TGCTGGACCC CGGGCCTGGT GGTTCTGCTC CTCGACGGCC TGAACTGCAG 

801 GCAGTGTGGC GTGCAGCATG TGAAAAGGTG GTTCCTGCTG CTGGCGCTGC 

851 TCAACTCCGT CGTGAACCCC ATCATCTACT CCTACAAGGA CGAGGACATG 

901 TATGGCACCA TGAAGAAgAT GATCTGCTGC TTCTCTCAGG AGAACCCAGA 

951 GAGGCGTCCC TCTCGCATCC CCTCCACAGT CCTCAGCAGG AGTGACACAG 

1001 GCAGCCAGTA CATAGAGGAT AGTATTAGCC AAGGTGCAGT CTGCAATAAA 

1051 AGCACTTCCT AA 
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Figure 4A: Aligned hedg-5 cDNA and predicted amino acid 
sequence. The first 250 bp of DNA sequence (lower case) is 
derived from genomic DNA flanking the 5' end of the cDNA 
insert from clone pC3-hedg-55. Sequences from nt 251-1523 are 
shown in lower case wherever apparent polymorphisms in 
different human clones were found. Coding region polymorphisms 
are detailed in Table 1. One intron exists within the coding 
region of hedg-5, located between nt 996/997 of the cDNA 
sequence shown. 



caccttcctaacctgagcggcctagcctgggaaacaaacaattaaaatgtgcgctaaatg 
I + + + -+ + + 60 

gtggaaggattggactcgccggatcggaccctttgtttgttaattttacacgcgatittac 

ctgtggtaggaggtcaggggctatgtcctggaccaaaggacatttgcactgagacctgac 

61 --+ + + + + + 120 

gacaccatcctccagtccccgatacaggacctggtttcctgtaaacgtgactctggactg 

acttcaggtcttcaactcccttgatgggagttagccagaacgggcttagaaacagcaatt 

121 + + + -f + --- + 180 

tgaagtccagaagttgagggaactaccctcaatcggtcttgcccgaatctttgtcgrcaa 

gatggcttagtgactgattttacaaatgatatttgtttcttctttaaatttctttccagg 

181 +- -+ + + + 240 

ctaccgaatcactgactaaaatgtttactataaacaaagaagaaatttaaagaaagatcc 

MNECHYDKHMDFFY 
atgttcactt CTTCTCC^CAATGAATGAGTGTCACTATGACAAGCACATGGACTTTTTTT 

241 + + + + + 300 

tacaagtgaagaagAGGTGTTACTTACTCACAGTGATACTGTTCGTGTACCTGAAAAAAA 

NRSNTDTVDDWTGTKLVIVL 
ATAATAGGAGCAACACTGATACTGTCGATGACTGGACAGGAACAAAGCTTGTGATTGTTT 
301 + + + + + 360 

TATTATCCTCGTTGTGACTATGACAGCTACTGACCTGTCCTTGTTTCGAACACTAACAAA 

CVGTFFCLFI FFSNSLVIAA 
TGTGTGTTGGGACGTTTTTCTGCCTGTTTATTTTTTTTTCTAATTCTCTGGTCATCGCGG 

361 + + -+ +- - + + 420 

ACACACAACCCTGCAAAAAGACGGACAAATAAAAAAAAAGATTAAGAGACCAGTAGCGCC 

VIKNRKFHFPFYYLLANLAA 
CAGTGATCAAAAACAGAAAATTTCATTTCCCCTTTTACTACCTGTTGGCTAATTTAGCTG 

421 + + + + -+ -+ 480 

GTCACTAGTTTTTGTCTTTTAAAGTAAAGGGGAAAATGATGGACAACCGATTAAATCGAC 

ADFFAG I AYVFLMFNTGPVS 
CTGCCGATTTCTTCGCTGGAATTGCCTATGTATTCCTGATGTTTAACACAGGCCCAGTTT 

481 + + --- + + + + 540 

GACGGCTAAAGAAGCGACCTTAACGGATACATAAGGACTACAAATTGTGTCCGGGTCAAA 

KTLTVNRWFLRQGLLD SSLT 
CAAAAACTTTGACTGTCAACCGCTGGTTTCTCCGTCAGGGGCTTCTGGACAGTAGCTTGA 

541 - + + + + + 600 

GTTTTTGAAACTGACAGTTGGCGACCAAAGAGGCAGTCCCCGAAGACCTGTCATCGAACT 
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Figure 4A (cont.) 

ASLTNLLVIAVERHMS IMRM 
CTGCTTCCCTCACCAACTTGCTGGTTATCGCCGTGGAGAGGCACATGTCAATCATGAGGA 

601 + + + --- + + + 660 

GACGAAGGGAGTGGTTGAACGACCAATAGCGGCACCTCTCCGTGTACAGTTAGTACTCCT 

RVHSNLTKKRVTLL ILLVWA 

661 + + + + + + 720 

ACGCCCAGGTATCGTTGGACTGGTTTTTCTCCCACTGTGACGAGTAAAACGAACAGACCC 

IAIFMGAVPTLGWNCLCNIS 
CCATCGCCATTTTTATGGGGGCGGTCCCCACACTGGGCTGGAATTGCCTCTGCAACATCT 

721 + + + + + + 780 

GGTAGCGGTAAAAATACCCCCGCCAGGGGTGTGACCCGACCTTAACGGAGACGTTGTAGA 

ACSSLAPIYSRSYLVFWTVS 
CTGCCTGCTCTTCCCTGGCCCCCATTTACAGCAGGAGTTACCTTGTTTTCTGGACAGTGT 

781 + + + + + + 840 

GACGGACGAGAAGGGACCGGGGGTAAATGTCGTCCTCAATGGAACAAAAGACCTGTCACA 

NLMAF L I MVVVYLR I YVYVK 
CCAACCTCATGGCCTTCCTCATCATGGTTGTGGTGTACCTGCGGATCTACGTGTACGTCA 

841 + + + + + + 900 

GGTTGGAGTACCGGAAGGAGTAGTACCAACACCACATGGACGCCTAGATGCACATGCAGT 

RKTNVLSPHTSGS ISRRRTP 
AGAGGAAAACCAACGTCTTGTCTCCGCATACAAGTGGGTCCATCAGCCGCCGGAGGACAC 

901 +- + ---+ +-.- - + + 960 

TCTCCTTTTGGTTGCAGAACAGAGGCGTATGTTCACCCAGGTAGTCGGCGGCCTCCTGTG 

MKLMKTVMTVLGAFVVCWTP 
CCATGAAGCTAATGAAGACGGTGATGACTGTCTTAGGGGCGTTTGTGGTATGCTGGACCC 

961 + + + + + + 1020 

GGTACTTCGATTACTTCTGCCACTACTGACAGAATCCCCGCAAACACCATACGACCTGGG 

GL VVL PLDGLNCRQCGVQHV 
CGGGCCTGGTGGTTCTGCCCCTCGACGGCCTGAACTGCAGGCAGTGTGGCGTGCAGCATG 

1021 + ~ -+ +--- + + " "--+ 1080 

GCCCGGACCACCAAGACGGGGAGCTGCCGGACTTGACGTCCGTCACACCGCACGTCGTAC 

KRWFLLLALLNSVVNPI IYS 
TGAAAAGGTGGTTCCTGCTGCTGGCGCTGCTCAACTCCGTCGTGAACCCCATCATCTACT 

1081 -- + +— + +" -+ + 1140 

ACTTTTCCACCAAGGACGACGACCGCGACGAGTTGAGGCAGCACTTGGGGTAGTAGATGA 

YKDEDMYGTMKKMI CCFSQE 
CCTACAAGGACGAGGACATGTATGGCACCATGAAGAAGATGATCTGCTGCTTCTCTCAGG 

1141 — + + - -+ + 1200 

GGATGTTCCTGCTCCTGTACATACCGTGGTACTTCTTCTACTAGACGACGAAGAGAGTCC 

NPERR PSRIPSTVLSRSDTG 
AGAACCCAGAGAGGCGTCCCTCTCGCATCCCCTCCACAGTCCTCAGCAGGAGTGACACAG 

12 oi + + --- + - -+ -- + + 1260 

TCTTGGGTCTCTCCGCAGGGAGAGCGTAGGGGAGGTGTCAGGAGTCGTCCTCACTGTGTC 

SQYIEDS ISQGAVCNKSTS* 
GCAGCCAGTACATAGAGGATAGTATTAGCCAAGGTGCAGTCTGCAATAAAAGCACTTCCT 

1261 + + — + + + 1320 

CGTCGGTCATGTATCTCCTATCATAATCGGTTCCACGTCAGACGTTATTTTCGTGAAGGA 
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AAACTCTGGATGCCTCTYGGCCCACCCAGGCCTCCTCTGGGAAAAGAGCTGTTAAGAATG 

1321 + + + + + + 1380 

TTTGAGACCTACGGAGARCCGGGTGGGTCCGGAGGAGACCCTTTTCTCGACAATTCTTAC 

ATTACCTGTCTCTAACAAAGCCCATGTACAGTGTTATTTGAGGTCTCCATTAATCACTGC 

1381 + -+ + + + + 1440 

TAATGGACAGAGATTGTTTCGGGTACATGTCACAATAAACTCCAGAGGTAATTAGTGACG 

TAGATTTCTTTAAAAAATTTTTTTTCATAGTTTAAAAGCATGGGCAGTAAAGAGAGGACC 

1441 + + + + + + 1500 

ATCTAAAGAAATTTTTTAAAAAAAAGTATCAAATTTTCGTACCCGTCATTTCTCTCCTGG 

TGCTGCATTTAGAGAAAG CACAG 

1501 - + 1523 

ACGACGTAAATCTCTTTCGTGTC 



Figure 4B: Predicted amino acid sequence of hEDG5 encoded by 
clone pC3-hEDG5#3.4 

1 MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL 

51 VIAAVIKNRK FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN 

101 RWFLRQGLLD SSLTASLTNL LVIAVERHMS IMRMRVHSNL TKKRVTLLIL 
151 LVWAIAIFMG AVPTLGWNCL CNISACSSLA PIYSRSYLVF WTVSNLMAFL 

201 IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT VMTVLGAFW 

251 CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 

301 YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK 
351 STS 

Figure 4C: Predicted amino acid sequence of h£DG5 encoded by 
clone pC3-hEDG5#28. 

1 MNECHYDKHM DFFYNRGNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL 

51 VIAAVIKNRK FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN 

101 RWFLRQGLLD SSLTASLTNL LVIAVERHMS IMRMRVHSNL TKKRVTLLIL 

151 LVWAIAIFMG AVPTLGWNCL CNISACSSLA PIYSRSYLVF WTVSNLMAFL 

201 IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT VMTVLGAFW 

251 CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 

3 01 YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK 
351 STS 
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Figure 5A: Alignment of predicted amino acid sequences of 
HEDG5 translation products of clones pC3-hEdg5#3 .4, pC3-hEdg5« 
3 and pC3-hEdg5#28 . Amino acid substitutions are indicated in 
reverse bold text. 



he5#3.4_981216 
hsedgS 
he5#28 981215 



he5#3.4_981216 
hsedgS 
he5#28 981215 



he5#3.4_981216 
hsedgS 
he5#28 981215 



he5#3.4_981216 
hsedgS 
he5#28 981215 



he5#3.4_981216 
hsedg5 
he5#28 981215 



he5#3.4_981216 
hsedgS 
he5#2B 981215 



he5#3.4_981216 
hsedgS 
he5#28 981215 



1 . 50 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL 
MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL 
MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL 

51 100 
VIAAVIKNRK FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN 
VIAAVIKNRK FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN 
VIAAVIKNRK FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN 

101 150 

RWFLRQGLLD SSLTASLTNL LVIAVERHMS IMRMRVHSNL TKKRVTLLIL 

RWFLRQGLLD SSLTASLTNL LVIAVERHMS IMRMRVHSNL TKKRVTLLIL 

RWFLRQGLLD SSLTASLTNL LVIAVERHMS IMRMRVHSNL TKKRVTLLIL 

151 200 

LVWAIAIFMG AVPTLGWNCL CNISACSSLA PIYSRSYLVF WTVSNLMAFL 

LVWAIAIFMG AVPTLGWNCL CNISACSSLA PIYSRSYLVF WTVSNLMAFL 

LVWAIAIFMG AVPTLGWNCL CNISACSSLA PIYSRSYLVF WTVSNLMAFL 

201 250 
IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT VMTVLGAFW 
IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT VMTVLGAFW 
IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT VMTVLGAFW 

251 300 

CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 

CWTPGLWLJ3 LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 

CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 

301 350 

YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK 

YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK 

YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK 



he5#3. 4_981216 
hsedgS 
he5#28 981215 



351 
STS- 
STS- 
STS- 
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Figure 5B: Alignment of the amino acid sequence of the murine 
edg-5 with the amino acid sequence of the human edg-5 of the 
pC3-hEdg5-3 .4 clone. 



SCORES Initl: 1981 Initn: 2155 Opt: 2170 Z-SCOre: 384.5 E(): 2e-16 

Smith-Waterman score: 2170; 91.2% identity in 354 aa overlap 

10 20 30 40 50 60 

MNECHYDKRMDFFYNRSNTDTADEWTGTKLVIVLCVGTFFCLFIFFSNSLVIAAVITNRK 

I M M II I M M M M M I II M M I II 1 1 1 1 1 IM Ml MM IIIIIIIIIIMI Ml 

MNECHYDKHMDFFYNRSNTDTTO^ 

10 20 30 40 50 60 



MEDG5 
HEDG5-3.4 



70 80 90 100 110 120 

MEDG5 FHFPFYYLIANLAAADFFAGIAYVFLMFNTGPVSKTLTVTJRWF 

I M MM 1 IMMIMMIII M M Ml M Ml Ml Ml I II 1 1 M MMM II Ml Ml 

HEDG5 -3.4 FHFPFYYLLANLAAADFFAGIAYVFLMFNTGPVSKTLTVNRWFLRQGLLDSSLTASLTNL 

70 80 90 100 110 120 

130 140 150 160 170 180 

MEDGS LVIAV^RHMSIMRMRVHSNLTKKRVTLLILLVWAIAIFMGAVPTLGWNCLCNISACSSLA 

I II MM M MMM M MM MIMM I Ml IMM MMM Ml MM M IMM Ml 

HEDG5 -3 - 4 LVIAV^RHMSIMRMRVlISNLTKKRvTLLILLVWAIAIFMGAVPTLGWNCLCNISACSSLA 

130 140 150 160 170 180 

190 200 210 220 230 240 

MEDG5 PIYSRSYLIFWTVSNLLAFFIMVAVYVRIYMWKRKTNVTjS 

' IMM IMMIMI I MMIM IMMM MMI MMM MM III IMIMMIII II I 

HEDG5-3 . 4 PIYSRSYLVFWTVSNLMAFLI^mArYLRIYVYVKRKTNVLSPHTSGSISRRRTPMKL^aCT 

190 200 210 220 230 240 

250 260 270 280 290 300 

MEDGS VTOTVLGAFWCWTPGLWLLLDGI^CKQCNV I I YS YKDEDM 

I MM III 1 1 Mill M MM I Ml IM I Ml I MM M Ml M II I M I M M Ml M I 

HEDG5 - 3 - 4 VKIVLGAFWCWTPGLVVliLLDGLNCRQCGVQHVKRWFLLL^ 

250 260 270 280 290 300 



310 320 330 340 350 

MEDG5 YNTMRKMICCALQDSNTERRPSRNPSTIHSRSETGSQYLEDSISQGPVCNKNGS 

MMM MM M I MMM IM: 1 1 1 41 II M 1 1 M M I MM: | 

HEDG5 -3 . 4 YGTMKKMICCFSQE-NPERRPSRIPSTVLSRSDTGSQYIEDSISQGAVCNKSTS 

310 320 330 340 350 
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SEQUENCE LISTINGS 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

NAME: ALLELIX B ^PHARMACEUTICALS INC. 
STREET: 6850 Goreway Drive 
CITY: Mississauga 
PROVINCE: Ontario 
COUNTRY: Canada 
POSTAL CODE: L4V 1V7 
TELEPHONE: (905) 677-0831 
FACSIMILE: (905) 677-9595 

(ii) TITLE OF INVENTION: MAMMALIAN EDG-5 RECEPTOR HOMOLOGS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) NAME: Orange Chari Pillay 

(B) STREET: Suite 3600, P.O. Box 190 

Toronto Dominion Bank Tower 
Toronto -Dominion Centre 

(C) CITY: Toronto 

(D) PROVINCE: Ontario 

(E) COUNTRY : Canada 

(F) ZIP: M5K 1H6 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: DOS EDITOR 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/CA98/ 01193 

(B) FILING DATE: 24 -DEC- 1998 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) COUNTRY: U.S.A. 

(B) APPLICATION NUMBER: 08/997,803 

(C) FILING DATE: 24-DEC-1997 

(D) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Santosh K. Chari 

(B) FIRM: Orange Chari Pillay 

(C) REFERENCE NUMBER: 8700213-0006 

(D) TELEPHONE: (416) 868-3457 

(E) FACSIMILE: (416) 364-7910 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AAYTRSATMT STAAYYTGCG TGCGA 25 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ KEY : Modified Site 

(B) LOCATIONS 

(D) OTHER INFORMATION: N is Inosine 

<ix) FEATURE: 

(A) NAME /KEY: Modified Site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: N is Inosine 

(ix) FEATURE: 

(A) NAME/ KEY : Modified Site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: N is Inosine 

(ix) FEATURE: 

(A) NAME/KEY: Modified Site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: N is Inosine 

( ix > FEATURE : 

(A) NAME /KEY: Modified Site 

(B) LOCATION:25 

(D) OTHER INFORMATION: N is Inosine 

(ix) FEATURE: 

(A) NAME /KEY : Modified Site 

(B) LOCATION: 28 

(D) OTHER INFORMATION: N is Inosine 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CTGNYKWTTC ATNAWNMMRT ANAYNAYNGG RTT 33 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 634 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

AAC ACT GGC CCG GTG TCG AAA ACG TTG ACC GTC AAC CGC TGG TTC CTC 4 8 

Asn Thr Gly Pro Val Ser Lys Thr Leu Thr Val Asn Arg Trp Phe Leu 
1 5 10 15 

CGC CAG GGG CTC CTA GAC ACC AGC CTG ACT GCC TCC CTG GCC AAT TTG 96 
Arg Gin Gly Leu Leu Asp Thr Ser Leu Thr Ala Ser Leu Ala Asn Leu 
20 25 30 

CTG GTT ATT GCT GTG GAA AGA CAC ATG TCN ATC ATG AGG ATG AGA GTC 144 
Leu Val He Ala Val Glu Arg His Met Ser He Met Arg Met Arg Val 
35 40 45 

CAC AGC AAC TTG ACC AAA AAG CGG GTG ACG CTG CTC ATT CTG CTG GTG 192 
His Ser Asn Leu Thr Lys Lys Arg Val Thr Leu Leu He Leu Leu Val 
50 55 60 

TGG GCC ATC GCC ATC TTC ATG GGG GCC GTC CCC ACN CTG GGA TGG AAT 240 
Trp Ala He Ala He Phe Met Gly Ala Val Pro Thr Leu Gly Trp Asn 
65 70 75 80 

TGC CTC TGC AAC ATC TCG GCC TGC TCT TCT CTG GCT CCC ATT TAG AGT 288 

Cys Leu Cys Asn He Ser Ala Cys Ser Ser Leu Ala Pro He Tyr Ser 

85 90 95 

AGG AGT TAC CTC ATT TTC TGG ACT GTG TCC AAC CTC CTG GCC TTC TTC 336 

Arg Ser Tyr Leu He Phe Trp Thr Val Ser Asn Leu Leu Ala Phe Phe 
100 105 110 

ATC ATG GTG GCG GTA TAC GTA CGC ATC TAC ATG TAT GTT AAA AGG AAA 384 
He Met Val Ala Val Tyr Val Arg He Tyr Met Tyr Val Lys Arg Lys 
115 120 125 

ACC AAC GTC TTA TCT CCA CAC ACC AGT GGC TCC ATC AGC CGC CGG AGG 432 
Thr Asn Val Leu Ser Pro His Thr Ser Gly Ser He Ser Arg Arg Arg 
130 135 140 

GCT CCC ATG AAG CTA ATG AAG ACA GTG ATG ACC GTC TTA GGC GCC TTC 480 
Ala Pro Met Lys Leu Met Lys Thr Val Met Thr Val Leu Gly Ala Phe 
145 150 155 160 

GTG GTG TGC TGG ACC CCG GGT CTG GTG GTT CTG CTG CTG GAC GGC CTG 528 
Val Val Cys Trp Thr Pro Gly Leu Val Val Leu Leu Leu Asp Gly Leu 
165 170 175 
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AAC TGC AAG CAG TGT AAC GTG CAA CAC GTG AAG NGC TGG TTC CTG CTG 576 
Asn Cys Lys Gin Cys Asn Val Gin His Val Lys Xaa Trp Phe Leu Leu 
180 185 190 

CTC GCA CTG CTC AAC TCC GTC ATG AAC CCC CTC ATC TAC TGC CGC TCT 624 
Leu Ala Leu Leu Asn Ser Val Met Asn Pro Leu lie Tyr Cys Arg Ser 
195 200 205 

CCN NAC TTT CCA TGG 639 
Pro Xaa Phe Pro Trp 
210 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTTTTACTCG AGATTTGCTG GTTATTGCTG TGGAAAG 37 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTTTTTCTAG ACGGTCATCA CTGTCTTCAT TAGCTTC 37 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
ATGCGGCTGC ATAGCAACCT GACCAAAAAG 30 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATCCGCAGGT ACACCACAAC CATGATGAGG 30 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
TTTTGAGCAA GTTCAGCCTG GTTAAGT 27 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGGCTTATGA GTATTTCTTC CAGGGTA 27 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGGTAGTCGG TACCTCTAGA GCAAGTTCAG CC 32 
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(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATAACAGAGG ATCCTCGAGT ATTTCTTCCA G 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 261.. 1322 

(ix) FEATURE: 

(A) NAME/KEY: Termination codon 

(B) LOCATION: 1320.. 1322 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CACCTTCCTA ACCTGAGCGG CCTAGCCTGG GAAACAAACA ATTAAAATGT GCGCTAAATG 60 

CTGTGGTAGG AGGTCAGGGG CTATGTCCTG GACCAAAGGA CATTTGCACT GAGACCTGAC 120 

ACTTCAGGTC TTCAACTCCC TTGATGGGAG TTAGCCAGAA CGGGCTTAGA AACAGCAATT 180 

GATGGCTTAG TGACTGATTT TACAAATGAT ATTTGTTTCT TCTTTAAATT TCTTTCTAGG 24 0 

ATGTTCACTT CTTCTCCACA ATG AAT GAG TGT CAC TAT GAC AAG CAC ATG 290 

Met Asn Glu Cys His Tyr Asp Lys His Met 
215 220 

GAC TTT TTT TAT AAT AGG AGC AAC ACT GAT ACT GTC GAT GAC TGG ACA 33 8 

Asp Phe Phe Tyr Asn Arg Ser Asn Thr Asp Thr Val Asp Asp Trp Thr 
225 230 235 

GGA ACA AAG CTT GTG ATT GTT TTG TGT GTT GGG ACG TTT TTC TGC CTG 386 
Gly Thr Lys Leu Val He Val Leu Cys Val Gly Thr Phe Phe Cys Leu 
240 245 250 

TTT ATT TTT TTT TCT AAT TCT CTG GTC ATC GCG GCA GTG ATC AAA AAC 434 
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Phe He Phe Phe Ser Asn Ser Leu Val He Ala Ala Val He Lys Asn 
255 260 265 

AGA AAA TTT CAT TTC CCC TTT TAC TAC CTG TTG GCT AAT TTA GCT GCT 4 82 

Arg Lys Phe His Phe Pro Phe Tyr Tyr Leu Leu Ala Asn Leu Ala Ala 
270 275 280 285 

GCC GAT TTC TTC GCT GGA ATT GCC TAT GTA TTC CTG ATG TTT AAC ACA 530 
Ala Asp Phe Phe Ala Gly He Ala Tyr Val Phe Leu Met Phe Asn Thr 
290 295 300 

GGC CCA GTT TCA AAA ACT TTG ACT GTC AAC CGC TGG TTT CTC CGT CAG 578 
Gly Pro Val Ser Lys Thr Leu Thr Val Asn Arg Trp Phe Leu Arg Gin 
305 310 " 315 

GGG CTT CTG GAC AGT AGC TTG ACT GCT TCC CTC ACC AAC TTG CTG GTT 626 
Gly Leu Leu Asp Ser Ser Leu Thr Ala Ser Leu Thr Asn Leu Leu Val 
320 325 330 

ATC GCC GTG GAG AGG CAC ATG TCA ATC ATG AGG ATG CGG GTC CAT AGC 674 
He Ala Val Glu Arg His Met Ser lie Met Arg Met Arg Val His Ser 
335 340 345 

AAC CTG ACC AAA AAG AGG GTG ACA CTG CTC ATT TTG CTT GTC TGG GCC 722 
Asn Leu Thr Lys Lys Arg Val Thr Leu Leu He Leu Leu Val Trp Ala 
350 355 360 365 

ATC GCC ATT TTT ATG GGG GCG GTC CCC ACA CTG GGC TGG AAT TGC CTC 770 
He Ala He Phe Met Gly Ala Val Pro Thr Leu Gly Trp Asn Cys Leu 
370 375 380 

TGC AAC ATC TCT GCC TGC TCT TCC CTG GCC CCC ATT TAC AGC AGG AGT 818 
Cys Asn He Ser Ala Cys Ser Ser Leu Ala Pro He Tyr Ser Arg Ser 
385 390 395 

TAC CTT GTT TTC TGG ACA GTG TCC AAC CTC ATG GCC TTC CTC ATC ATG 866 
Tyr Leu Val Phe Trp Thr Val Ser Asn Leu Met Ala Phe Leu He Met 
400 405 410 

GTT GTG GTG TAC CTG CGG ATC TAC GTG TAC GTC AAG AGG AAA ACC AAC 914 
Val Val Val Tyr Leu Arg lie Tyr Val Tyr Val Lys Arg Lys Thr Asn 
415 420 425 

GTC TTG TCT CCG CAT ACA AGT GGG TCC ATC AGC CGC CGG AGG ACA CCC 962 
Val Leu Ser Pro His Thr Ser Gly Ser lie Ser Arg Arg Arg Thr Pro 
430 435 440 ~ " 445 

ATG AAG CTA ATG AAG ACG GTG ATG ACT GTC TTA GGG GCG TTT GTG GTA 1010 
Met Lys Leu Met Lys Thr Val Met Thr Val Leu Gly Ala Phe Val Val 
450 455 ' 460 

TGC TGG ACC CCG GGC CTG GTG GTT CTG CCC CTC GAC GGC CTG AAC TGC 1058 
Cys Trp Thr Pro Gly Leu Val Val Leu Pro Leu Asp Gly Leu Asn Cys 
465 470 475 

AGG CAG TGT GGC GTG CAG CAT GTG AAA AGG TGG TTC CTG CTG CTG GCG 1106 
Arg Gin Cys Gly Val Gin His Val Lys Arg Trp Phe Leu Leu Leu Ala 
480 485 490 

CTG CTC AAC TCC GTC GTG AAC CCC ATC ATC TAC TCC TAC AAG GAC GAG 1154 
Leu Leu Asn Ser Val Val Asn Pro He He Tyr Ser Tyr Lys Asp Glu 
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495 



500 



505 



GAC ATG TAT GGC ACC ATG AAG AAG ATG ATC TGC TGC TTC TCT CAG GAG 
Asp Met Tyr Gly Thr Met Lys Lys Met lie Cys Cys Phe Ser Gin Glu 
510 515 520 525 



1202 



AAC CCA GAG AGG CGT CCC TCT CGC ATC CCC TCC ACA GTC CTC AGC AGG 
Asn Pro Glu Arg Arg Pro Ser Arg lie Pro Ser Thr Val Leu Ser Arg 
530 535 540 



1250 



AGT GAC ACA GGC AGC CAG TAC ATA GAG GAT AGT ATT AGC CAA GGT GCA 
Ser Asp Thr Gly Ser Gin Tyr lie Glu Asp Ser lie Ser Gin Gly Ala 
545 550 555 



1298 



GTC TGC AAT AAA AGC ACT TCC TAA ACTCTGGATG CCTCTYGGCC CACCCAGGCC 
Val Cys Asn Lys Ser Thr Ser 

560 565 



1352 



TCCTCTGGGA AAAGAGCTGT TAAGAATGAT TACCTGTCTC TAACAAAGCC CATGTACAGT 1412 
GTTATTTGAG GTCTCCATTA ATCACTGCTA GATTTCTTTA AAAAATTTTT TTTCATAGTT 1472 



(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 

GAATTCGCGG CCGCGTCGAC GTTCACTTCT CCACAATGAA TGAGTGTCAC TATGACAAGC 60 

ACATGGACTT TTTTTATAAT AGGAGCAACA CTGATACTGT CGATGACTGG ACAGGAACAA 120 

AGCTTGTGAT TGTTTTGTGT GTTGGGACGT TTTTCTGCCT GTTTATTTTT TTTTCTAATT 180 

CTCTGGTCAT CGCGGCAGTG ATCAAAAACA GAAAATTTCA TTTCCCCTTT TACTACCTGT 240 

TGGCTAATTT AGCTGCTGCC GATTTCTTCG CTGGAATTGC CTATGTATTC CTGATGTTTA 300 

ACACAGGCCC AGTTTCAAAA ACTTTGACTG TCAACCGCTG GTTTCTCCGT CAGGGGCTTC 360 

TGGACAGTAG CTTGACTGCT TCCCTCACCA ACTTGCTGGT TATCGCCGTG GAGAGGCACA 420 

TGTCAATCAT GAGGATGCGG GTCCATAGCA ACCTGACCAA AAAGAGGGTG ACACTGCTCA 480 

TTTTGCTTGT CTGGGCCATC GCCATTTTTA TGGGGGCGGT CCCCACACTG GGCTGGAATT 540 

GCCTCTGCAA CATCTCTGCC TGCTCTTCCC TGGCCCCCAT TTACAGCAGG AGTTACCTTG 600 

TTTTCTGGAC AGTGTCCAAC CTCATGGCCT TCCTCATCAT GGTTGTGGTG TACCTGCGGA 660 

TCTACGTGTA CGTCAAGAGG AAAACCAACG TCTTGTCTCC GCATACAAGT GGGTCCATCA 720 



TAAAAGCATG GGCAGTAAAG AGAGGACCTG CTGCATTTAG 



AGAAAGCACA G 



1523 
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GCCGCCGGAG GACACCCATG AAGCTAATGA AGACGGTGAT GACTGTCTTA GGGGCGTTTG 780 

TGGTATGCTG GACCCCGGGC CTGGTGGTTC TGCCCCTCGA CGGCCTGAAC TGCAGGCAGT 840 

GTGGCGTGCA GCATGTGAAA AGGTGGTTCC TGCTGCTGGC GCTGCTCAAC TCCGTCGTGA 900 

ACCCCATCAT CTACTCCTAC AAGGACGAGG ACATGTATGG CACCATGAAG AAGATGATCT 960 

GCTGCTTCTC TCAGGAGAAC CCAGAGAGGC GTCCCTCTCG CATCCCCTCC ACAGTCCTCA 1020 

GCAGGAGTGA CACAGGCAGC CAGTACATAG AGGATAGTAT TAGCCAAGGT GCAGTCTGCA 1080 

ATAAAAGCAC TTCCTAAACT CTGGATGCCT CTGGCCCACC CAGGCCTCCT CTGGGAAAAG 1140 

AGCTGTTAAG AATGATTACC TGTCTCTAAC AAAGCCCATG TACAGTGTTA TTTGAGGTCT 1200 

CCATTAATCA CTGCTAGATT TCTTTAAAAA ATTTTTTTTC ATAGTTTAAA AGCATGGGCA 1260 

GTAAAGAGAG GACCTGCTGC ATTTAGAGAA AGCACAGGTC GACGCGGCCG CGAATTCTTT 1320 

TGCTTTTTAC CCTGGAAGAA ATACTCGAGC ATGCAT 1356 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asn Glu Cys His Tyr Asp Lys His Met Asp Phe Phe Tyr Asn Arg 
15 10 15 

Ser Asn Thr Asp Thr Val Asp Asp Trp Thr Gly Thr Lys Leu Val He 
20 25 30 

Val Leu Cys Val Gly Thr Phe Phe Cys Leu Phe He Phe Phe Ser Asn 
35 40 45 

Ser Leu Val He Ala Ala Val He Lys Asn Arg Lys Phe His Phe Pro 
50 55 60 

Phe Tyr Tyr Leu Leu Ala Asn Leu Ala Ala Ala Asp Phe Phe Ala Gly 
65 70 75 80 

He Ala Tyr Val Phe Leu Met Phe Asn Thr Gly Pro Val Ser Lys Thr 
85 90 95 

Leu Thr Val Asn Arg Trp Phe Leu Arg Gin Gly Leu Leu Asp Ser Ser 
100 105 110 

Leu Thr Ala Ser Leu Thr Asn Leu Leu Val He Ala Val Glu Arg His 
115 120 125 

Met Ser He Met Arg Met Arg Val His Ser Asn Leu Thr Lys Lys Arg 
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130 135 140 

Val Thr Leu Leu lie Leu Leu Val Trp Ala lie Ala lie Phe Met Gly 
145 150 155 160 

Ala Val Pro Thr Leu Gly Trp Asn Cys Leu Cys Asn lie Ser Ala Cys 
165 170 175 

Ser Ser Leu Ala Pro lie Tyr Ser Arg Ser Tyr Leu Val Phe Trp Thr 
180 185 190 

Val Ser Asn Leu Met Ala Phe Leu lie Met Val Val Val Tyr Leu Arg 
195 200 205 

lie Tyr Val Tyr Val Lys Arg Lys Thr Asn Val Leu Ser Pro His Thr 
210 215 220 

Ser Gly Ser lie Ser Arg Arg Arg Thr Pro Met Lys Leu Met Lys Thr 
225 230 235 240 

Val Met Thr Val Leu Gly Ala Phe Val Val Cys Trp Thr Pro Gly Leu 
245 250 255 

Val Val Leu Pro Leu Asp Gly Leu Asn Cys Arg Gin Cys Gly Val Gin 
260 265 270 

His Val Lys Arg Trp Phe Leu Leu Leu Ala Leu Leu Asn Ser Val Val 
275 280 285 

Asn Pro lie He Tyr Ser Tyr Lys Asp Glu Asp Met Tyr Gly Thr Met 
290 295 300 

Lys Lys Met He Cys Cys Phe Ser Gin Glu Asn Pro Glu Arg Arg Pro 
305 310 315 320 

Ser Arg He Pro Ser Thr Val Leu Ser Arg Ser Asp Thr Gly Ser Gin 
325 330 335 

Tyr He Glu Asp Ser He Ser Gin Gly Ala Val Cys Asn Lys Ser Thr 
340 345 350 

Ser 



(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Thr Gly Pro Val Ser Lys Thr Leu Thr Val Asn Arg Trp Phe Leu 
15 10 15 

Arg Gin Gly Leu Leu Asp Thr Ser Leu Thr Ala Ser Leu Ala Asn Leu 
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20 25 30 

Leu Val He Ala Val Glu Arg His Met Ser He Met Arg Met Arg Val 
35 40 45 

His Ser Asn Leu Thr Lys Lys Arg Val Thr Leu Leu He Leu Leu Val 
50 55 6o 

Trp Ala He Ala He Phe Met Gly Ala Val Pro Thr Leu Gly Trp Asn 
65 70 75 80 

Cys Leu Cys Asn lie Ser Ala Cys Ser Ser Leu Ala Pro He Tyr Ser 
85 90 95 

Arg Ser Tyr Leu lie Phe Trp Thr Val Ser Asn Leu Leu Ala Phe Phe 
100 105 no 

He Met Val Ala Val Tyr Val Arg He Tyr Met Tyr Val Lys Arg Lys 
115 12 0 125 

Thr Asn Val Leu Ser Pro His Thr Ser Gly Ser lie Ser Arg Arg Arg 
130 135 140 

Ala Pro Met Lys Leu Met Lys Thr Val Met Thr Val Leu Gly Ala Phe 
145 150 155 ' i 60 

Val Val Cys Trp Thr Pro Gly Leu Val Val Leu Leu Leu Asp Gly Leu 
165 170 175 

Asn Cys Lys Gin Cys Asn Val Gin His Val Lys Xaa Trp Phe Leu Leu 
180 185 190 

Leu Ala Leu Leu Asn Ser Val Met Asn Pro Leu lie Tyr Cys Arg Ser 
195 200 205 

Pro Xaa Phe Pro Trp 
210 
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